_ — Office de la Proprtete 
|t| Intellectuelle 
du Canada 

Un organisme 
d'lndustrie Canada 



Canadian 

Intellectual Property 
Office 

An agency of 
Industry Canada 



CA 2451492 A1 2003/01/16 

(2D 2* 451 492 

02)DEMANDE DE BREVET CANADIEN 
CANADIAN PATENT APPLICATION 

(13) A1 



(86) Date de d6p6t PCT/PCT Filing Date: 2002/07/02 

(87) Date publication PCT/PCT Publication Date: 2003/01/16 

(85) EntrSe phase nationale/National Entry: 2003/12/18 

(86) N° demande PCT/PCT Application No.: EP 2002/007281 

(87) N° publication PCT/PCT Publication No.: 2003/004659 
(30) Priority/Priority: 2001/07/04 (101 31 786.7) DE 



(51)CLInt. 7 /lnt.CI. 7 C12N 15/90, A01K 67/027, C12N 15/82, 
C12N5/00 

(71) Demandeurs/Applicants: 
SUNGENE GMBH & CO. KGAA, DE; 
INSTITUT F. PFLANZENGENETIK, DE 

(72) Inventeurs/lnventors: 
PUCHTA, HOLGER, DE; 
BIESGEN, CHRISTIAN, DE . 

(74) Agent: ROBIC 



(54)Titre : SYSTEMES DE RECOMBINAISON ET PROCEDES POUR RETIRER DES SEQUENCES D'ACIDE 

NUCLEIQUE DU GENOME D'ORGANISMES EUCARYOTES 
(54) Title: RECOMBINATION SYSTEMS AND A METHOD FOR REMOVING NUCLEIC ACID SEQUENCES FROM THE 

GENOME OF EUKARYOTIC ORGANISMS 



(57) Abr6g6/Abstract: 

The invention relates to recombination systems and to a method for removing nucleic acid sequences from the chromosomal 
DNA of eukaryotic organisms. The invention also relates to transgenic organisms (preferably plants), containing said systems or 
produced by said method. 



7\ Tl <\ C\ PI http://opic.gc.ca • Ottawa-Hull Kl A 0C9 • http://cipo.gc.ca 

^ ^ OPIC CIPO 191 



OPIC 




CIPO 



CA 02451492 2003-12-18 



1 

RECOMBINATION SYSTEMS AND A METHOD FOR REMOVING NUCLEIC 
ACID SEQUENCES FROM THE GENOME OF EUKARYOTIC ORGANISMS 

5 The invention relates to recombination systems and methods for 
eliminating nucleic acid sequences from the genome of eukaryotic 
organisms, and to transgenic organisms — preferably plants — 
comprising these systems. 

10 The purpose of biotechnological research into organisms consists 
in, inter alia, obtaining commercially utilizable informationen 
on the function of certain genes and gene products and the 
elucidation of biosynthetic pathways or disease mechanisms. The 
information obtained in this manner can be employed in a 

15 multiplicity of ways. They serve for example for the production 
of novel medicaments, the development of alternative, 
biotechnological production methods or the generation of modified 
plants. An aim of biotechnological research into plants is the 
generation of plants with advantageous novel characteristics, for 

20 example for increasing agricultural productivity, improving the 
quality in foodstuffs or for the production of certain chemicals 
or pharmaceuticals (Dunwell JM, J Exp Bot. 2000; 51 Spec 
No:487-96). 

25 In the generation of transgenic organisms, selection of the 
organisms which have been modified in the desired manner is 
required owing to the poor efficacy of the methods used (such as, 
for example, stable transformation or, in particular, homologous 
recombination). Transgenic plants can be generated by a series of 

30 techniques (Review: Potrykus I. and Spangenberg G. ed. (1995) 
Gene transfer to plants. Springer, Berlin). In particular the 
gene transfer mediated by Agrobacterium tumefaciens and the 
bombardment of plant cells with the particle gun play an 
important role in this context . An important problem is the fact 

35 that transgenic DNA, once stably introduced into an organism, can 
only be removed with difficulty. The genes for resistance to 
antibiotics or herbicides, which are used during the 
transformation procedure for selection purposes, remain in the 
transgenic plants, which contributes substantially to the lack of 

40 acceptance of these ''gene food" products among consumers. 
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It has therefore been attempted for some time to develop 
techniques by means of which foreign DNA can be integrated into 
the plant genome at the specific sites or reexcised therefrom (Ow 
DW and Medberry SL (1995) Crit Rev in Plant Sci 14:239-261). 
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The skilled worker is familiar with a variety of systems for the 
site-directed removal of recombinant ly introduced nucleic acid 
sequence^. They are based on the use of sequence— specif ic 
recombinases and two recognition sequences of said recombinases 
5 which flank the sequence to be removed. The effect of the 
recombinase on this construct brings about the excision of the 
flanked sequence, one of the recognition sequences remaining in 
the genome of the organism. Various sequence-specific 
recombination systems are described, such as the Cre/lox system 

10 of the bacteriophage PI (Dale EC and Ow DW (1991) Proc Natl Acad 
Sci USA 88:10558-10562; Russell SH et al. (1992) Mol Gene Genet 

. 234: 49-59? Osborne BI et al. (1995) Plant J. 7, 687-701), the 
yeast FLP/FRT system (Kilby NJ et al. (1995) Plant J 8:637-652; 
Lyznik LA et al. (1996) Nucleic Acids Res 24:3784-3789), the Mu 

15 phage Gin recombinase, the E. coli Pin recombinase or the R/RS 
system of the plasmid pSRl (Onouchi H et al.{1995) Mol. Gen. 
Genet. 247 :653-660 . ; Sugita Ket al. (2000) Plant J. 22:461-469). 
Here, the recombinase (for example Cre or FLP) interacts 
specifically with its corresponding recombination sequences (34 

20 bp lox sequence and 47 bp FRT sequence, respectively) in order to 
delete or invert the interposed sequences. Reports on successful 
applications of these systems in plants are limited. Thus, David 
Ow f s group has demonstrated that a selection marker used for the 
transformation of plants which was flanked by two lox sequences 

25 can be reexcised from the plant genome by the expression of Cre 
(Dale EC and Ow DW (1991) Proc Natl Acad Sci USA 88:10558-10562). 
A disadvantage of the sequence-specific recombination systems is 
the reversibility of the reaction, that is to say an equilibrium 
exists between excision and integration of the marker gene in 

30 question. This frequently brings about the selection of 
mutations, i.e. as soon as a mutation blocks the further 
interaction of the lox recognition sequences with the enzyme, the 
(undesired) product is removed from the equilibrium and fixed. 
This not only applies to the Cre-lox system, but also to the 

35 other sequence-specific recombinases (see above). A further 

disadvantage is the fact that one of the recognition sequences of 
the recombinase remains in the genome, which is thus modified. 
This may have effects on the characteristics of the organisms 
when, for example, the recognition sequence modifies or destroys 

40 reading frames or genetic control elements such as promotors or 
enhancers. Furthermore, the recognition sequence which remains in 
the genome excludes a further use of the recombination system, 
for example for a second genetic modification, since interactions 
with the subsequently introduced recognition sequences cannot be 

45 ruled out. Substantial chromosomal rearrangements or deletions 
may result. 
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Zubko et al. describe a system for the deletion of nucleic acid 
sequences from the tobacco genome, where the sequence to be 
deleted is flanked by two 352 bp attp recognition sequences from 
the bacteriophage Lambda . Deletion of the flanked region takes 
5 place independently of the expression of helper proteins in two 
of eleven transgenic tobacco lines by spontaneous 
intrachromosomal recombination between the attP recognition 
regions. The disadvantages of this method are that recombination, 
or deletion, cannot be induced specifically at a particular point 
10 in time, but takes place spontaneously. The fact that the method 
worked only in a small number of lines suggests that the 
integration locus in the cases in question tends to be unstable 
(Puchta H (2000) Trends in Plant Sci 5:273-274). 

15 On page 12 in the key to Figure 32, wo 96/14408 describes a 

method for eliminating a genetic locus in which in each case one 
recognition sequence of the homing restriction endonuclease 
I-Scel is inserted at the respective end of the sequence to be 
deleted. Treatment with the endonuclease leads to dbuble-strand 

20 breaks at both ends of the sequence to be deleted. The free ends 
then join up by means of "recombination". The "recombination" 
cited here can only be an illegitimate recombination — as can 
also be seen from the Figure - (for example a non-homologous 
end- joining (NHE J) event), since no homologous sequences exist at 

25 the two remaining ends of the genomic DNA. Illegitimate 

recombination, however, leads to unpredictable recombination 
events. This may have effects on the characteristics of the 
organisms if for example reading frames or genetic control 
elements such as promotors or enhancers are modified or destroyed 

30 thereby. The system requires two recognition sequences which 
flank the fragment to be deleted. 

The generation of sequence-specific double-strand breaks with the 
aid of restriction enzymes in eukaryotic genomes such as yeast 
35 (Haber JE (1995) Bioassays 17 : 609-620 ), mammalian cells ( Jasin M 
(1996) Trends Genet. 12:224-228) or plants (Puchta H (1999a) 
Methods Mol Biol 113:447-451) is described. 

What is described is the induction of an intramolecular 
40 recombination on a plasmid DNA in Xenopus oocytes by 

sequence-specific cleavage with the endonuclease I-Scel (Segal DJ 
and Caroll D (1995) Proc Natl Acad Sci USA 92:806-810) or by 
synthestic, chimeric nucleases (Bibikova M et al. (2001) Mol Cell 
Biol 21(1) :289-297) . The aim is the site-directed recombination 
45 between two homologous sequences between which a suitable 
nuclease cleavage site is located. Both cases are 
extrachromosomal recombination events in which in each case only 
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part of the extra chromosomal plasmid DNA undergoes homologous 
recombination. 

Posfai et al. describe a method for exchanging genes in the 
5 prokaryote E.coli (Posfai G et al* (1999) Nucleic Acids Res 

27(22) :4409-4415) . Here, recombination between the endogenous and 
the mutated gene results in the E.coli genome, induced by 
cleavage with the restriction enzyme I-Scel. Aim and object was 
the exchange of an endogenous gene for a mutated transgene. 
10 Recombinations in E.coli proceed in a markedly simpler way and 
with greater efficacy than in higher eukaryotes (for example 
described by Kuzminov A (1999) Microbiol Mol Biol Rev. 
63(4):751-813). 

15 Diirrenberger et al. describe the induction of recombination in 
chloroplasts of the single-celled green alga Chlamydomonas 
reinhardtii using the I-Scel homing endonuclease (Diirrenberger F 
et al. (1996) Nucleic Acid Res 24(17) :3323-3331) . Recombination 
takes place between the endogenous 23S gene and an inserted 23S 

20 cbNA which contains a I-Scel cleavage site. Double-strand breaks 
are induced by mating the transgenic organism in question with an 
organism expressing I-Scel. Recombinations in chloroplasts 
proceed in a markedly simpler manner and with greater efficacy 
than in the chromosomal DNA of higher eukaryotes. Thus, indeed, 

25 homologous recombination appears to be the preferred, normal way 
of DNA integration in plastids (chloroplastes) (described in: 
Heifetz PB and Tuttle AM (2001) Curr Opinion Plant Biol 
4; 157-161). It appears that plastids have a specific system which 
enables them to undergo homologous recombination, as opposed to 

30 the nucleus, and facilites the site-directed introduction of 
foreign DNA (Heifetz PB (2000) Biochimie 82:655-666). 

The gene targeting technique, in which a site-directed 
integration into the chromosomal DNA of the host organism is to 

35 be achieved by means of homologous recombination works acceptably 
well only in the case of prokaryotes and yeast. The generation of 
corresponding transgenic organisms is possible in a few species 
only (such as, for example, mice) and even then highly 
complicated (see also Kanaar R Hoeijmakers JH (1997) Genes Funct 

40 1(3) :165-174) . The existing, poor homologous recombination 

efficacy (approx. I:lxl0 6 ) is compensated for in this case by the 
use of complicated, sophisticated selection techniques which are 
limited to the species in question (such as, for example, "ES" 
cell technology) . In other species — but above all in Higher 

45 Plants — such technologies have not been established as yet 
(Mengiste T and Paszkowski J (1999) Biol Chem. 380:749-758; 
Vergunst AC and Hooykaas PJJ (1999) Crit Rev Plant Sci 18:1-31; 
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Puchta H (1999) Methods Mol Biol 113:447-451; Hohn B and Puchta H 
(1999) Proc Natl Acad Sci USA 96:8321-8323). Attempts to achieve 
homologous recombination in plants resulted in random, 
nonhomologous " illegitimate " insertion events in most cases. 
5 Here r the DNA introduced is integrated at one or more 

unpredictable sites in the plant genome. Integration takes place 
by means of illegitimate recombination (Roth DB and Wilson JH 
(1988) Illegitimate recombination in mammalian cells. In "Genetic 
recombination", R. Kucherlapati and G.R. Smith Edts., American 

10 Society of Micorbiology [sic] f Washington, USA; pp. 621-635) and 
not in sequence regions which are homologous to the transferred 
DNA (Puchta H and Hohn B (1996) Trends Plant Sci. 1:340-348). The 
problem of lacking efficacy in homologous recombination, which is 
serious predominantly in plants, is generally known to the 

15 skilled worker. The causes are addressed by current research 
(Review article: Mengiste T and Paszkowski J (1999) Biological 
Chemistry 380(7-8) :749-58) • Increasing the efficacy of homologous 
recombination has long been a need in plant biotechnology which 
is hitherto unresolved. 

20 

A further need which has long existed in biotechnological 
research and which is not addressed by any of the established 
systems is the provision of systems and methods which enables the 
site-directed elimination of nucleic acid sequences from the 

25 chromosomal DNA of a eukaryotic organism and allow the repeated 
application to the same organism. For example, it is an aim of 
plant biotechnology further to improve by means of recombinant 
methods existing high-yielding varieties. In this context, it is 
particularly important to eliminate, after the transformation has 

30 taken place, superfluous transgene sequences such as selection 
markers • In addition, methods for the predictable elimination of 
sequences, for example from the chromosomal DNA of an organism, 
would offer further applications in the field of genetic 
engineering which are of great interest scientifically and 

35 economically. 

It is an object of the present invention to develop systems and 
methods which enable the predictable elimination of defined 
nucleic acid sequences from the chromosomal DNA of a eukaryotic 
40 organism and allow the repeated, successive application to the 
same organism. 

We have found that this edition [sic] has been achieved in a 
surprising manner by providing the recombination system according 
45 to the invention. 
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A first subject matter of the invention relates to a 
recombination system for eliminating a DNA sequence from the 
chromosomal DNA of a eukaryotic cell or organism, which 
comprises, in a eukaryotic cell or organism, 

5 

I) a transgenic recombination construct inserted into the 
chromosomal DNA of a eukaryotic organism comprising a 
sequence consisting, in the 5 ' /3 '-direction , of 

10 al) a first homology sequence A and 

bl) at least one recognition sequence for the site-directed 
induction of DNA double-strand breaks and 

15 a2) a second homology sequence B, the homology sequences A 

and B having a sufficient length and sufficient homology 
in order to ensure homologous recombination, 

together with 

20 

II) an enzyme suitable for inducing DNA double-strand breaks at 
the recognition sequence (bl) for the site-directed induction 
of DNA double-strand breaks or a nucleic acid sequence 
encoding an enzyme suitable for inducing DNA double-strand 

25 breaks at the recognition sequence (bl). 

A further subject matter of the invention relates to a method for 
eliminating a DNA sequence from the chromosomal DNA of a 
eukaryotic cell or organism, which comprises combining, in a 
30 eukaryotic cell or organism, 

I) a transgenic recombination construct inserted into the 
chromosomal DNA of a eukaryotic organism comprising a 
sequence consisting, in the 5 ' /3 ' -direction, of 

35 

al) a first homology sequence A and 

bl) at least one recognition sequence for the site-directed 
induction of DNA double-strand breaks and 

40 

a2) a second homology sequence B, the homology sequences A 
and B having a sufficient length and sufficient homology 
in order to ensure homologous recombination, 

45 together with 
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II) an enzyme suitable for inducing DNA double-strand breaks at 
the recognition sequence (bl) for the site-directed induction 
of DNA double-strand breaks, 

5 and the induction of DNA double-strand breaks at the recognition 
sequence for the site-directed induction of DNA double-strand 
breaks and the homologous recombination taking place between the 
homology sequences A and B. 

10 The invention enables sequences (for example selection markers 
such as genes for resistance to antibiotics or herbicides) to be 
deleted from the chromosomal DNA of an organism in an accurately 
predictable manner. In doing so, the sequence to be eliminated is 
flanked by recognition sequences for the site-directed induction 

15 of DNA double-strand breaks (for example recognition sequences of 
rare-cleaving restriction enzymes) and combined with homologous 
sequences in the region of the cleavage sites. A double-strand 
break is induced by an enzyme suitable for inducing DNA 
double-strand breaks at the recognition sequence for the 

20 site-directed induction of DNA double-strand breaks, (for example 
a sequence-specific nuclease), which, in consequence, triggers 
the homologous recombination of homologous sequences located at 
the break, and thus the deletion of any nucleic acid sequences 
located between the sequences. The recognition sequence for the 

25 site-directed induction of DNA double-strand breaks is likewise 
deleted, and the method can thus be used repeatedly for further 
controlled genetic modifications. 

Surprisingly, this induced homologous recombination takes place 
30 with high efficacy and precision, which is in contrast to 
previous experience in the field of homologous recombination, 
including in plants. The frequency can be compared with the 
parallel, nonhomologous events (for example non-homologous 
end- joining events) (cf. Example 5). This is a remarkable finding 
35 which is in contrast to earlier observations, according to which 
the frequency of homologous recombination — above all in the case 
"of plants — is secondary, almost negligible, in comparison with 
the "illegitimate" events. 

40 The sequences which are deleted are those located between the 
homology sequences A and B. In contrast to systems such as, for 
example, the cre/lox or the FRT/FLP system, one is not bound to 
specific sequences when performing recombination. The skilled 
worker knows that any sequence can undergo homologous 

45 recombination with another sequence provided that sufficient 
length and homology exist. Owing to the sequence-specific 
induction of the double-strand breaks, the homologous 
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recombination efficacy between the homology sequences A and B is 
increased considerably, indeed enabled in the first place in some 
cases. 

5 With regard to the recombination construct, "transgene" refers to 
all those constructs which are the result of recombinant methods 
in which either 

a) at least one of the homology sequences A or B, or 

10 

b) at least one recognition sequence for the site directed 
induction of DNA double-strand breaks, or 

c) (a) and (b) 

15 

are not located in their natural genetic environment (for example 
at their natural chromosomal locus) or have been modified by 
recombinant methods, it being possible for the modification to 
encompass, for example, substitutions, additions, deletions, 
20 inversion [sic] or insertions of one or more nucleotide residues. 

"Eukaryotic cell or organism" generally refers to any eukaryotic 
cell or organism and to cells, tissues, parts or propagation 
material (such as seeds or fruits) derived from these in which an 

25 induction of double-strand breaks may take place at the 

recognition sequence for the site-directed induction of DNA 
double-strand breaks and the homologous recombination between the 
homology sequences A and B may take place while the recombination 
construct and the enzyme suitable for inducing DNA double-strand 

30 breaks at the recognition sequence for the site-directed 

induction of DNA double-strand breaks are simultaneously present 
in one reaction space (for example in a cell or cell 
compartment) . A particularly preferred embodiment encompasses 
compartments of a eukaryotic cell such as, for example, the 

35 nucleus. 

Cells or organisms which are especially preferably encompassed 
are those which constitute a multi-celled eukaryotic organism or 
are derived from the latter, and cells, tissues, parts or 

40 propagation material (such as seeds or fruits) of same. Very 
especially preferably encompassed cells or organisms are those 
which constitute an animal or plant organism or are derived from 
the latter, and cells, tissues, parts or propagation material of 
same. Most preferably encompassed cells or organisms are those 

45 which constitute a plant organism or are derived from the latter. 
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and cells, tissues, parts or propagation material of same. 
Preferred genera and species are detailed further below. 

Referring to the homology sequences A and B, ''sufficient length" 
5 preferably refers to sequences with a length of at least 20 base 
pairs, preferably at least 50 base pairs, especially preferably 
at least 100 base pairs, very especially preferably at least 250 
base pairs, most preferably at least 500 base pairs. 

10 Referring to the homology sequences A and B r "sufficient 
homology" preferably refers to sequences with at least 70%, 
preferably 80% , by preference at least 90%, especially preferably 
at least 95%, very especially preferably at least 99%, most 
preferably 100%, homology within these homology sequences over a 

15 length of of [sic] at least 20 base pairs, preferably at least 50 
base pairs, especially preferably at least 100 base pairs, very 
especially preferably at least 250 base pairs, most preferably at 
least 500 base pairs. 

20 Homology between two nucleic acid sequences is understood as 
meaning the identity of the nucleic acid sequence over in each 
case the entire sequence length which is calculated by alignment 
with the aid of the program algorithm GAP (Wisconsin Package 
Version 10.0, University of Wisconsin, Genetics Computer Group 

25 (GCG), Madison, USA), setting the folowing parameters: 

Gap Weight: 12 Length Weight: 4 

Average Match: 2,912 Average Mismatch: -2, 003 

30 

In a preferred embodiment, only one recognition sequence for the 
site-directed induction of DNA double-strand breaks is located 
between the homology sequences A and B, so that the recombination 
construct employed in the recombination system or method 
35 according to the invention is constructed in the 5 ' /3 ' -direction 
as follows: 

al) a first homology sequence A and 

40 bl) a recognition sequence for the site-directed induction of 
DNA double-strand breaks and 

a2) a second homology sequence B, the homology sequences A 
and B having a sufficient length and sufficient homology 
45 in order to ensure homologous recombination. 
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In a preferred embodiment, a further nucleic acid sequence is 
located between the homology sequences A and B, so that the 
recombination construct employed in the recombination system or 
method according to the invention is constructed as follows in 
5 the 5 '/3' -direction of 

al) a first homology sequence A and 

bl) a recognition sequence for the site-directed induction of 
10 DNA double-strand breaks and 

c) a further nucleic acid sequence and 

a2) a second homology sequence B, the homology sequences A 
15 and B having a sufficient length and sufficient homology 

in order to ensure homologous recombination. 

The recognition sequence for the site-directed induction of DNA 
double-strand breaks may also be located after or within the 
20 further nucleic acid sequence. 

In a further preferred embodiment, a second recognition sequence 
for the site-directed induction of double-strand breaks is 
present after the further nucleic acid sequence. This embodiment 

25 is advantageous in particular in the case of homology sequences A 
and B which are further apart, or in the case of longer further 
nucleic acid sequences, since recombination efficacy is 
increased. In this embodiment, the recombination construct 
employed in the recombination system or method according to the 

30 invention is constructed as follows in 5'/3' direction of 

al) a first homology sequence A and 

bl) a first recognition sequence for the site-directed 
35 induction of DNA double-strand breaks and 

c) a further nucleic acid sequence and 

b2) a second recognition sequence for the site-directed 
40 induction of DNA double-strand breaks and 

a2) a second homology sequence B, the homology sequences A 
and B having a sufficient length and sufficient homology 
in order to ensure homologous recombination. 

45 
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Furthermore, other recognition sequences may also be present 
between the homology sequences A and B, in addition to the second 
recognition sequences for the site-directed induction of DNA 
double-strand breaks. The individual recognition sequences (for 
5 example bl or b2) for the site-directed induction of DNA 

double-strand breaks may be identical or different, i.e. they may 
act as recognition sequence for ah individual enzyme for the 
site-directed induction of DNA double-strand breaks or else for a 
variety of enzymes. Here, the embodiment in which the recognition 
10 sequences for the site-directed induction of DNA double-strand 
breaks act as recognition sequence for an individual enzyme for 
the site-directed induction of DNA double-strand breaks is 
preferred in this context. 

15 The skilled worker is familiar with a variety of ways to obtain 
one of the recombination constructs according to the invention. 
They can be prepared by means of customary recombination and 
cloning techniques as are described, for example, in T. Maniatis, 
E.F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory 

20 Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1989), in T,J; Silhavy, M.L. Berman and L.W. Enquist, 
Experiments with Gene Fusions, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY (1984) and in Ausubel, F.M. et al., 
Current Protocols in Molecular Biology, Greene Publishing Assoc. 

25 and Wiley Interscience (1987). 

Preferably, the recombination construct according to the 
invention is generated by joining the abovementioned essential 
constituents of the recombination construct together in the 
30 abovementioned sequence using the recombination and cloning 
techniques with which the skilled worker is familiar, and the 
result is then introduced into the chromosomal DNA of a host 
organism. 

35 However, the skilled worker is aware that he may also obtain the 
recombination construct according to the invention in other ways. 
Thus, the host organism may already comprise one or more of the 
essential components of the recombination construct. The 
recombination construct according to the invention is then 

40 generated by introducing one further, or more, essential 

components of the recombination construct in the correct position 
relative to the existing components in said organism. Thus, for 
example, the starting organism may already comprise one of the 
homology sequences A or B. If the organism already comprises a 

45 homology sequence A, introducing a construct consisting of a 
recognition sequence for the site-directed induction of DNA 
double-strand breaks and a second homology sequence B after the 
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homology sequence A gives rise to one of the recombination 
constructs according to the invention. 

Furthermore, the skilled worker is familiar with various ways in 
5 which the recombination construct according to the invention may 
be introduced into the chromosomal DNA of a eukaryotic cell or 
organism. In this context, the insertion may be directed (i.e. 
taking place at a defined insertion site) or undirected (i.e. 
taking place randomly) . Suitable techniques are known to the 
10 skilled worker and described by way of example hereinbelow. 

"Enzyme suitable for inducing DNA double-strand breaks an the 
recognition sequence for the site-directed induction of DNA 
double-strand breaks" (hereinbelow "DSBI enzyme", which stands 
15 for "double strand -break inducing enzyme ") generally refers to 
all those enzymes which are capable of generating double-strand 
breaks in double stranded DNA in a sequence-specific manner. The 
following may be mentioned by way of example, but not by 
limitation: 

20 

1. Restriction endonucleases (type II), preferably homing 
endonucleases as described in detail hereinbelow. 

2. Recombinases (such as, for example, Cre/lox; R-RS; FLP/FTR as 
25 described above) 

3. Transposases, for example the P-element transposase (Kaufman 
PD and Rio DC (1992) Cell 69(l):27-39) or AcDs (Xiao YL and 
Peterson T (2000) Mol Gen Genet 263( 1 ) :22-29) . In principle, 

30 all transposases or integrases are suitable as long as they 
have sequence specificity (Haren L et al. (1999) Annu Rev 
Microbiol. 1999;53:245-281; Beall EL, Rio DC (1997) Genes 
Dev. 11(16) :2137-2151). 

35 4. Chimeric nucleases as described in detail hereinbelow. 

5. Enzymes which induce double-strand breaks in the immune 
system, such as the RAG1/RAG2 system (Agrawal A et al. (1998) 
Nature 394(6695) :744-451) . 

40 

6. Group II intron endonucleases. Modifications of the intron 
sequence allows group II introns to be directed to virtually 
any sequence in a double-stranded DNA, where group II introns 
can subsequently insert by means of a reverse splice 

45 mechanism (Mohr et al- (2000) Genes & Development 14:559-573; 

Guo et al. (2000) Science 289:452- 457). During this reverse 
splice mechanism, a double-strand break is introduced into 
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the target DNA, the excised intron RNA cleaving the sense 
strand while the protein portion of the group II intron 
endonuclease hydrolyses the antisense strand (Guo et al. 
(1997) EMB0 J 16: 6835- 6848). If it is only desired to 
5 induce the double-strand break without achieving complete 

reverse splicing, as is the case in the present invention, it 
is possible to resort to, for example, group II intron 
endonucleases which lack the reverse transcriptase activity. 
While this does not prevent the generation of the 
10 double-strand break, the reverse splicing mechanism cannot 
proceed to completion. 

Suitable enzymes are not only natural enzymes, but also synthetic 
enzymes. 

15 

Preferred enzymes are all those DSBI enzymes whose recognition 
sequence is known and which can either be obtained in the form of 
their proteins (for example by purification) or expressed using 
their nucleic acid sequence. 

20 

Especially preferred are restriction endonucleases (restriction 
enzymes ) which have no or only a few recognition sequences — 
besides the recognition sequences present in the transgenic 
recombination construct - in the chromosomal DNA sequence of a 
25 particular eukaryotic organism. This avoids further double-strand 
breaks at undesired loci in the genome. 

This is why homing endonucleases are very especially preferred 
(Review: (Belfort M and Roberts RJ (1997) Nucleic Acids Res 25: 
30. 3379-3388; Jasin M (1996) Trends Genet. 12:224-228; Internet: 
http://rebase.neb.com/rebase/rebase.homing.html). Owing to their 
long recognition sequences, they have no, or only a few, further 
recognition sequences in the chromosomal DNA of eukaryotic 
organisms in most cases. 

35 

The sequences encoding for such homing endonucleases can be 
isolated for example from the chloroplast genome of Chlamydomonas 
(Tunnel M et al. (1993) J Mol Biol 232: 446-467). They are small 
(18 to 26 kD) and their open reading frame (ORF) has a " coding 

40 usage" which is suitable directly for nuclear expression in 
eukaryotes (Monnat RJ Jr et al. (1999) Biochem Biophys Res Com 
255:88-93). Homing endonucleases which are very especially 
preferably isolated are the homing endonucleases I-Scel 
(WO96/14408), I-Scell (Sarguiel B et al. (1990) Nucleic Acids Res 

45 18:5659-5665), I-SceIII (Sarguiel B et al. (1991) Mol Gen Genet. 
255:340-341), I-Ceul (Marshall (1991) Gene 104:241-245), I-Crei 
(Wang J et al. (1997) Nucleic Acids Res 25: 3767-3776), I-Chul 
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(Cote V et al.(1993) Gene 129:69-76) , I-TevI (Chu et al. (1990) 
Proc Natl Acad Sci USA 87:3574-3578; Bell-Pedersen et al. (1990) 
Nucleic Acids Resl8:3763-3770) , I-TevII (Bell-Pedersen et al. 
(1990) Nucleic Acids Resl8 : 3763-3770 ) , I-TevIII (Eddy et al. 
5 (1991) Genes Dev. 5:1032-1041), Endo Seel (Kawasaki et al. (1991) 
J Biol Chem 266:5342-5347), I-Cpal (Turmel M et al. (1995a) 
Nucleic Acids Res 23:2519-2525) and I-Cpall (Turmel M et al. 
(1995b) Mol. Biol. Evol. 12, 533-545). 

10 Further homing endonucleases are detailed in the abovementioned 
Internet website, and examples which may be mentioned are homing 
endonucleases such as F-Scel, F-Scell, F-Suvl, F-TevI, F-TevII, 
I-Amal, I-Anil, I-Ceul, I-CeuAIIP, I-Chul, I-Cmoel, I-Cpal, 
I-CpaII, I-Crel, I-CrepsbIP, I-CrepsbIIP, 1-CrepsbIIIP, 

15 l-CrepsbIVP, I-Csml, I-Cvul, 1-CvuAlP, I-Ddil, I-DdiII, I -Dirt, 
I-Dmol, I-Hmul, I-Hmull, I-HspNIP, I-Llal, I-Msol, I-Naal, 
I-NanI, 1-NclIP, I-NgrIP, I-Nitl, I-Njal, I-Nsp236lP, I-PakI, 
I-PboIP, I-PcuIP, I-PcuAI, I-PcuVI, I-PgrIP, I-PobIP, 1-Porl, 
I-PorIIP, 1-PpblP, I-Ppol, 1-SPBetaIP, I-Scal, I-Scel, I-SceII, 

20 I-Scelll , I-SceIV, I-SceV, I-SceVI, I-SceVH, l-SexIP r I-SneIP, 
I-SpomCP, I-SpomIP, I-SpomIIP, I-SquIP, I-Ssp6803l, I-SthPhiJP, 
I-SthPhiST3P, I-SthPhiS3bP, I-TdeIP, I-TevI, I-TevII, I-TevIII, 
I-UarAP, I-UarHGPAlP, I-UarHGPA13P, I-VinIP, I-ZbiIP, PI-MtuI, 
PI-MtuHIP, PI-MtuHIIP, Pl-Pful, Pl-PfuII, Pl-Pkol, Pl-PkoII, 

25 PI-PspI, PI-Rma43812IP, PI-SPBetalP, Pl-Scel, PI-Tful, PI-TfuII, 
PI-Thyl, PI-Tlil, PI-Tlill. 

Preferred in this context are the homing endonucleases whose gene 
sequences are already known, such as, for example, 
30 F-Scel,I-CeuI,I-ChuI, I-Dmol, I-Cpal, I-CpaII, I-Crel, I-Csml, 
F-TevI, F-TevII, I-TevI, I-TevII, I-Anil, I-Cvul, I-Ddil, I-Hmul, 
I-HmuII, I-Llal, I-NanI, I-Msol, I-Nitl, I-Njal, I-PakI, I-Porl, 
I-Ppol, I-Scal, I-Ssp6803l, Pl-Pkol, Pl-PkoII, PI-PspI, PI-Tful, 
PI-Tlil. 

Very especially preferred are commercially available homing 
endonucleases such as I-Ceul, I-Scel, I-Dmol, I-Ppol, PI-PspI or 
Pl-Scel. 

40 The enzymes can be isolated from their organisms of origin in the 
manner with which the skilled worker is familiar, and/or their 
coding nucleic acid sequence can be cloned. The sequences of 
various enzymes are deposited in GenBank. 

45 Very especially preferred are the homing endonucleases I-Scel, 
I-Cpal, I-CpaII, I-Crel and I-Chul. Most preferred are the homing 
endonucleases as shown in SEQ ID NO: 2, 4, 6, 8 or 10. 
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Synthetic DSBI enzymes which may be mentioned by way of example 
are chimeric nucleases which are composed of an unspecific 
nuclease domaine and a sequence-specific DNA binding domaine 
consisting of zinc fingers (Bibikova M et al. (2001) Mol Cell 
5 Biol. 21:289-297). These DNA-binding zinc finger domaines can be 
adapted to suit any DNA sequence. Suitable methods for preparing 
suitable zink finger domaines are described and known to the 
skilled worker (Beerli RR et al., Proc Natl Acad Sci USA. 2000; 
97 (4):1495-1500; Beerli RR, et al., J Biol Chem 2000; 

10 275(42) :32617-32627; Segal DJ and Barbas CF 3rd., Curr Opin Chem 
Biol 2000; 4(1) : 34-39; Rang JS and Kim JS, J Biol Chem 2000; 
275(12) :8742-8748; Beerli RR et al., Proc Natl Acad Sci USA 1998; 
95(25):14628-14633; Kim Js' et al., Proc Natl Acad Sci USA 1997; 
94(8) :3616-3620; Klug A, J Mol Biol 1999; 293 ( 2 ) :215-218 ; Tsai SY 

15 et al., Adv Drug Del iv Rev 1998;30( 1-3) :23-31; Mapp AK et al., 
Proc Natl Acad Sci USA 2000; 97(8) :3930-3935; Sharrocks AD 
et al., Int J Bioc hem Cell Biol 1997; 29(12) :1371-1387; Zhang L 
et al., J Biol Chem 2000; 275(43) :33850-33860) . 

20 The DSBI enzyme is preferably expressed as a fusion protein with 
a nuclear localization sequence (NLS). This NLS sequence enables 
facilitated transport into the nucleus and increases the efficacy 
of the recombination system. A variety of NLS sequences are known 
to the skilled worker and described, inter alia, by Jicks GR and 

25 Raikhel NV (1995) Annu. Rev. Cell Biol. 11:155-188. Preferred for 
plant organisms is, for example, the NLS sequence of the SV40 
large antigen. Very especially preferred are the following NLS 
sequences: 

30 NLS1: N-Pro-Lys-Thr-Lys-Arg-Lys-Val-C (SEQ ID NO: 29) 

NLS2: N-Pro-Lys-Lys-Lys-Arg-Lys-Val-C (SEQ ID NO: 30) 

The homing endonucleases as shown in SEQ ID NO: 4, 6, 8 or 10 
35 used in the use examples are fusion proteins of the native 
nucleases and the NLS2 nuclear localization sequence. 

Owing to the small size of many DSBI enzymes (such as, for 
example, the homing endonucleases) , an NLS sequence is not 
40 necessarily required. These enzymes are capable of passing 

through the nuclear pores even without any aid. This is confirmed 
by the efficacy of the homing endonuclease as shown in SEQ ID NO: 
2 which has been used and which encompasses no nuclear 
localization sequence. 

45 
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In a further preferred embodiment, the activity of the DSBI 
enzyme can be induced. Suitable methods have been described for 
sequence-specific recombinases (Angrand PO et al. (1998) Nucl. 
Acids Res. 26(13) : 3263-3269; Logie C and Stewart AF (1995) Proc 
5 Natl Acad Sci USA 92(13) :5940-5944; Imai T et al. (2001) Proc 
Natl Acad Sci USA 98 ( 1 ): 224-228 ) . These methods employ fusion 
proteins of the DSBI enzyme and the ligand binding domain for 
steroid hormone receptor (for example the human androgen 
receptor, or mutated variants of the human estrogen receptor as 
10 described therein). Induction may be effected with ligands such 
as, for example, estradiol, dexamethasone, 4 -hydroxy tamoxifen or 
raloxifen. 

Some DBSI enyzmes are active as dimers (homo- or heterodimers ; 

15 I-Crel forms a homodimer; I-SecIV forms a heterodimerk) (Wernette 
CM (1998) Biochemical & Biophysical Research Communications 
248( 1) :127-333 ) ) . Dimerization can be designed as an inducible 
feature, for example by exchanging the natural dimerization 
domaines for the binding domaine of a low-molecular -weight 

20 ligand. Addition of a dimeric ligand then brings about 

dimerization of the fusion protein. Corresponding inducible 
dimerization methods, and the preparation of the dimeric ligands, 
have been described (Amara JF et al. (1997) Proc Natl Acad Sci 
USA 94(20): 10618-1623; Muthuswamy SK et al. (1999) Mol Cell Biol 

25 19(10) .-6845-685; Schultz LW and Clardy J (1998) Bioorg Med Chem 
Lett. 8(1): 1-6; Keenan T et al. (1998) Bioorg Med Chem. 
6(8) :1309-1335) . 

"Recognition sequence for the site-directed induction of DNA 
30 double-strand breaks" generally refers to those sequences which, 
under the conditions in the eukaryotic cell or organism used in 
each case, enable the recognition and cleavage by the DSBI 
enzyme. The recognition sequences for the respective DSBI enzymes 
detailed are mentioned in Table 1 hereinbelow by way of example, 
35 but not by limitation. 

- Table 1: Recognition sequences and organisms of origin of DSBI 
enyzmes ("*" indicates the cleavage site of the DSBI enzyme 
within a recognition sequence). 

40 



45 
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10 



20 



30 



35 



40 



45 



DSBI 
enzyme 


Organism 
of origin 


Recognition sequence 


CRE 


Bacteriophage 
PI 


5 ' -AACTCTCATCGCTTCGGATAACTTCCTGTTATCCGAAACAT 
ATCACTCACTTTGGTGATTTCACCGTAACTGTCTATGATTAATG 
-3' 


FLP 


Saccharomyces 
cerevisiae 


D — (3AAC3 xTCCTAT xCCGAAGTTC-CTAT a C.TC.TAGAAAvjTAT AG 

GAACTTC-3 ' 


R 


pSRl 

Plasmids 


5 ' -CGAGATCATATCACTGTGGACGTTGATGAAAGAATACGTTA 
TTCTTTCATCAAATCGT 


P-Element 
Trans - 
posase 


Drosophila 


5 ' -CTAGATGAAATAACATAAGGTGG 


I-Anil | 


Aspergillus 
niauxcins 


5 ' -TTGAGGAGGTT * TCTCTGTAAATAANNNNNNNNNNNNNNN 


I-Ddil 


Dictyostelium 

QlsCUXUcufilnAJ 


5 ' -TTTTTTGGTCATCCAGAAGTATAT 


I-Cvul 


Chlorella 


5 ' -CTGGGTTCAAAACGTCGTGA ~ GACAGTTTGG 

7 ' _ fZAPPPA ACTTTTflP Af2 A P TxHWCmWH A IV Af*p 
J "VjnLL^-ftnu X X X X Vjv-AVj LAL lLlol LAAnUL 


I-CsmI 


Chlamydomonas 


5 ' -GTACTAGCATGGGGTCAAATGTCTTTCTGG 


I-Cmoel 


Chlamydomonas 

iUVJ CWUOll 


5 ' -TCGTAGCAGCT * C ACGGTT 


I-Crel 


Chlamydomonas 
reinhardtii 


5 ' -CTGGGTTCAAAACGTCGTGA* GACAGTTTGG 
3 ' -GACCCAAGTTTTGCAG* CACTCTGTCAAACC, 


I-Chul 


Chlamydomonas 
humicola 


5 ' - G AAGGTTTGGC ACC T CG A ATGTCGGCTC ATC 
3 ' -CTTCCAAACCGTG * GAGCTAC AGCCGAGTAG 


I-Cpal 


Chlamydomonas 
pallidostigma 
tica 


5 ' -CGATCCTAAGGTAGCGAA" ATTCA 
3 ' -GCTAGGATTCC ATC * GCTTTAAGT 


I-CpaII 


Chlamydomonas 
pallidostigma 
tica 


5 ' -CCCGGCTAACTC A TGTGCCAG 
3 ' -GGGCCGAT A TGAGACACGGTC 


I-Ceul 


Chlamydomonas 
eugametos 


5 ' -CGTAACTATAACGGTCCTAA* GGTAGCGAA 
3 ' -GCATTGATATTGCCAG A GATTCCATCGCTT 


I-Dmol 


Desulfuro- 

coccus 

mobilis 


5 ' -ATGCCTTGCCGGGTAA A GTTCCGGCGCGC AT 
3 ' -TACGGAACGGCC^CATTCAAGGCCGCGCGTA 


I-Scel 


S. cerevisiae 


5 ' - AGT T ACGCT AGGGAT AA * CAGGGTAATATAG 
3 ' -TCAATGCGATCCCTATTGTCCCATTATATC 
5 ' - T AGGGAT AA A CAGGGTAAT 

^ ' -ZVrrrC^TATTGTCCCATTA ( " Core S ecm pnrp \ 


I-Scell 


S. cerevisiae 


5 ' -TTTTGATTC TTTGGT C ACCC A TGAAGTATA 
# — a a a ar'pA a<za AArcAG A tggczacttcatat 


I-SceIII 


S. cerevisiae 


5 ' -ATTGGAGGTTTTGGTAAC A TATTTATTACC 

7 ' — TAAPPTPCA AAAfC* ATTfZATAAATAATGft 


I-SceIV 


S. cerevisiae 


5 ' -TCTTTTCTCTTGATTA~GCCCTAATCTACG 
3 ' -AflAAAAfiArSAAr^TAATnGGCATTAGATGP 


I-SceV 


S. cerevisiae 


5 ' -AATAATTTTCT A TCTTAGTAATGCC 
3 ' -TTATTAAAAGAAGAATC ATTA A CGG 


I -See VI 


S. cerevisiae 


5 ' -GTTATTTAATG* TTTTAGTAGTTGG 
3 ' -CAATAAATTACAAAATCATCA* ACC 
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DSBI 
enzyme 


Organism 
of origin 


Recognition sequence 




I-SceVII 


S • cerevisiae 


5 ' -TGTCACATTGAGGTGCACTAGTTATTAC 


5 


Pl-Scel 


S. cerevisiae 


5 ' -ATCTATGTCGGGTGC'GGAGAAAGAGGTAAT 
3 ' -TAGATACAGCCCACGCCTCTTTCTGCATTA 




F-Scel 


S. cerevisiae 


5 -GATGCTGTAGGC A ATAGGCTTGGTT 
3 ' -CTACGACA" TCCGTATCCGAACCAA 




F-Scell 


S • cerevisiae 


5 ' -CTTTCCGCAACA A GTAAAATT 
3 ' -GAAAGGCG A TTGTCATTTTAA 


10 


I-Hmul 


Bacillus 
subtilis 
bacteriophage 
SPOl 


5 ' -AGTAATGAGCCTAACGCTCAGCAA 
3 ' -TCATTACTCGGATTGC "GAGTCGTT 


15 


I-HmuII 


Bacillus 
subtilis 
bacteriophage 


5 ' — AGTAATGAGCCTAACGCTCAACA ANNT^^ 
NNNNNNNNNNNNNNNNNNNNNN 




I-Llal 


Lactococcus 

1 art 1 *5 c 


5 '-CACATCCATAAC^CATATCATTTTT 
7 ' •GVGTAGftTATTPfiTA'pAfiTAiv * aaa 




I-Msol 


Monomastix 

cnpr pc 


5 ' -CTGGGTTCAAAACGTCGTGA^GACAGTTTGG 
^ ' — GAPPPAACTTTTf2PAP*PAPTPnv , TV , *& A*pr» 


20 


I-Nanl 


Naegleria 


5 ' - AAGTCTGGTGCC A * GCACCCGC 
^ ' — ttpagapp* APGfiTPcppratpr^ 




I-NitI 


Naegleria 
italica 


5 ' - AAGTCTGGTGCC A ~ GCACCCGC 
3 f -TTCAGACC A ACGGTCGTGGGCG 


&o 


I-Njal 


Naegleria 
jamiesbni 


5 ' -AAGTCTGGTGCCA A GCACCCGC 
3 ' -TTCAGACC A ACGGTCGTGGGCG 


I-PakI 


Pseuden- 
doc Ionium 

ak i tip turn 

CLXWX. A 1 V* t» Will 


5 ' -CTGGGTTCAAAACGTCGTGA A GACAGTTTGG 
3/ -GACCCAAGTTTTGCAG^CACTCTGTCAAACC 




I-Porl 


Pyrobaculum 

or y ctxit/ uiopu tun 


5 r -GCGAGCCCGTAAGGGT "GTGTACGGG 

O ^UoUIUItooUaI i. vLUALAtAlv7Lt.C 


30 


I-Ppol 


Physarum 


5 ' -TAACTATGACTCTCTTAA~ GGTAGCCAAAT 

O * A 1 1 un 1 Au 1 0 Au AVj AA1 lULAluwl 1 1 A 




I-Scal 


Saccharomyces 


5 ' ~TGTCACATTGAGGTGCACT~AGTTATTAC 
3 ' -APAGTRTAAPTPPAP^GTf^ATPAATAATYt 

•J ~A*— AVJA W1AAV. l^UttL UlVjAl^AAlAAlVj 




I-Ssp6803I 


Synechocystis 
SDecies 


5 ' -GTCGGGCT * CATAACCCGAA 
3 ' — CAGCPPGAGTA^TTGGGPTT 


35 


Pl-Pful 


Pyrococcus 
furiosus Vcl 


5 ' - GAAGATGGGAGGAGGG A ACCGGACTCAACTT 
3 ' -CTTCTACCCTCC A TCCCTGGCCTGAGTTGAA 




Pl-PfuII 


Pyrococcus 
furiosus Vcl 


5 ' -ACGAATCCATGTGGAGA" AGAGCCTCTATA 
3 ' -TGCTTAGGTACAC A C TCTTCTCGGAGATAT 


40 


Pl-Pkol 


Pyrococcus 

kodakaraensis 

K0D1 


5 ' -GATTTTAGAT A CCCTGTACC 
3 ' -CTAAAA'TCTAGGGACATGG 




Pl-PkoII 


Pyrococcus 

kodakaraensis 

K0D1 


5 ' -CAGTACTACG ~GTTAC 
3 ' -GTCATG^ATGCCAATG 


45 


PI-PspI 


Pyrococcus 
sp. 


5 ' -AAaATCCTGGCAaACAGCTaTTAT^GGGTAT 
3 • -TT TTAGGACCGTTTG TCGAT A AATACCCATA 




PI-Tful 


Thermococcus 

fumicolans 

ST557 


5 ' -tagattttaggt^cgctatatccttcc 
3 '-atctaaaa'tccagcgatataggaagg 
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5 



10 



15 



DSBI 
enzyme 


Organism 
of origin 


Recognition sequence 


PI-TfuII 


Thermococcus 

fumicolans 

ST557 


5 ' -TAYGCNGAYACN "GACGGYTTYT 
3 ' -ATRCGHCT "RTGNCTGCCRAARA - 


PI -Thy I 


Thermococcus 

hydro- 

thermalis 


5 ' -TAYGCNGAYAC1TGACGGYTTYT 
3 ' -ATRCGNCT A RTGNCTGCCRAARA 


PI-Tlil 


Thermococcus 

li toralis 

XX. l»w^u^.X>0 


5 ' -TAYGCHGAYACNGACGG A YTT YT 
3 ' -ATRCGNCTRTGNC^TGCCRAARA 


PI-Tlill 


Thermococcus 


5 ' -AAATTGCTTGCAAACAGCTATTACGGCTAT 


I-TevI 


Bacteriophage 

T4 


5 ' -AGTGGTATCAAC "GCTCAGTAGATG 
3 ' — TCACCATAGT^TC^GAGTCATCTAC 1 


I-TevII 


Bacteriophage 
T4 


5 r -GCTTATGAGTATGAAGTGAACACGT" TATTC 
3 ' -CGAATACTCATACTTCACTTGTG A CAATAAG 


F-TevI 


Bacteriophage 
T4 


5 ' - G AAAC AC AAGA " AATGTTTAGTAAANNNNNNNNNNNNNN 
3 ' -CTTTGTGTTCOTTACAAATCATTTNWNNNNNNNNN^ 


F-TevII 


Bacteriophage 
T4 


5 ' -TTTAATCCTCGCTTC ~ AGATATGGCAACTG 
3 ' -AAATTAGGAGCGA* AGTC TATACC GTTGAC 



20 Also encompassed are minor deviations (degenerations) of the 

recognition sequence which still enable recognition and cleavage 
by the DSBI enzyme in question i Such deviations - also in 
connection with different framework conditions such as, for 
example, calcium or magnesium concentration — have been described 

25 (Argast GM et al. (1998) J Mol Biol 280: 345-353). Also 

encompassed are core sequences of these recognition sequences. It 
is known that the inner portions of the recognition sequences 
suffice for an induced double-strand break and that the outer 
ones are not absolutely relevant, but can codetermine the 

30 cleavage efficacy. Thus, for example, an 18bp core sequence can 
be defined for I-Scel. 

Recombination construct and DSBI enzyme can be combined to give 
one of the recombination systems or methods according to the 
35 invention in various ways with which the skilled worker is 

familiar. Thus, the recombination constructs and the DSBI enzyme 
can be combined in an organism, a cell, a cell compartment or a 
tissue for example as follows : 

40 1-) Organisms which have the recombination cassette inserted into 
the chromosomal DNA are generated in the customary manner. 
For example, such plants can be generated preferably by 
agrobacteria-mediated transformation. The primary 
trans formants which contain the recombination cassette are 

45 employed for the transformation with an expression cassette 

which ensures expression of the DSBI enzyme or grown in a 
suitable manner until homozygous, when they act as the host 
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organism (for example host plant) for the transformation with 
an expression cassette which ensures expression of the DSBI 
enzyme. Starting from these host plants, it is possible, for 
example, to initiate, establish and use for transformation 
5 purposes in -vitro cultures such as, for example, callus 

cultures or embryogenic cultures. Transformation with the 
expression cassette for the DSBI enzyme can be in each case 
stable or transient. 

10 2.) Organisms known as master organisms, which bear and express 
the corresponding gene for the DSBI enzyme (or an expression 
cassette which ensures the expression of the DSBI enzyme) are 
generated in the customary manner. For example, such master 
plants can be generated preferably by agrobacterium-mediated 

15 transformation. The primary transf ormants which express the 

DSBI enzyme are employed for the transformation with the 
recombination construct or grown in a suitable manner until 
homozygous, which is when they act as master organism or host 
organism (for example master plant) into which the 

20 recombination constructs are introduced. Starting from these 
master plants, it is possible, for example, to initiate, 
establish and use for transformation purposes in-vitro 
cultures such as, for example, callus cultures or embryogenic 
cultures . 

25 

3. ) The gene encoding the DSBI enzyme (or an expression cassette 

which ensures the expression of the DSBI enzyme) can be 
integrated into a vector which already bears the 
recombination cassette and thus introduced into plant cells ' 

30 simultaneously with the target gene. It is preferred to 

insert the gene encoding the DSBI enzyme between the homology 
sequences and thus to delete it from the chromosomal DNA 
after it has fulfilled its function. Very especially 
preferably, expression of the DSBI enzyme is inducible in 

35 such a case (for example under the control of one of the 

inducible promotors described hereinbelbw) , in a 
development -dependent fashion using a development -dependent 
promotor, or else DSBI enzymes are employed whose activity is 
inducible in order to avoid cleaving the recombination 

40 construct immediately after the transformation and prior to 
its insertion into the genome. 

4. ) Relying on the co- trans format ion technique, the expression 

cassette which ensures the expression of the DSBI enzyme can 
45 be transformed into the cells simultaneously with the 

recombination construct, but on a separate vector. 
Co-transformation can be in each case stable or transient. In 
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such a case, expression of the DSBI enzyme is preferably 
inducible (for example under the control of one of the 
inducible promotors described hereinbelow) , in a 
development-dependent fashion using a development-dependent 
5 promotor, or else DSBI enzymes are employed whose activity is 
inducible in order to avoid cleaving the recombination 
construct immediately after the transformation and prior to 
its insertion into the genome. 

10 5.) Organisms > for example plants or else animals, expressing the 
DSBI enzyme may also act as parent individuals. In the 
progeny from the hybridization between organisms expressing 
the DSBI enzyme on the one hand and organisms bearing the 
recombination construct on the other hand, the desired 

15 double-strand breaks and recombination between the homology 
sequences are observed, with the possible deletion of the 
sequences located between the homology sequences. 

6. ) Expression of the DSBI enzyme is also conceivable in a 

20 transient transformation approach in which the possibilities 
2 to 4 can be exploited. 

7. ) The DSBI enzyme can also be introduced into cells comprising 

or bearing the transgenic recombination construct directly, 
25 for example via microinjection, particle bombardment 

(biolistic method), polyethylene glycol transfection or 
liposome -mediated transfection. This embodiment is 
advantageous since no DSBI-enzyme-encoding sequences can 
remain in the genome. Such a method has been described for 
30 example by Segal DJ et al. (1995) Proc Natl Acad Sci USA 
92:806-810. 

8. ) The DSBI enzyme may also be generated by introducing the 

DSBI-enzyme-encoding, in-vitro-generated mRNA into cells (for 
35 example via microinjection, particle bombardment (biolistic 
method) or liposome-mediated transfection). This embodiment 
is advantageous since no DSBI-enzyme-encoding sequences can 
remain in the genome. 

40 9.) The DSBI enzyme can be introduced into plant cells as a 
fusion protein with the VirE2 or VirF protein of an 
agrobacterium. Such methods have been described for example 
for Cre recombinase (Vergunst AC et al. (2000) Science. 290: 
979-982). If the expression cassette for the fusion protein 

45 is located outside the border sequences, it is not inserted 
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into the plant genome. This embodiment is advantageous since no 
DSBI-enzyme-encoding sequences can remain in the genome. 

The recombination system or method according to the invention can 
5 be realized in intact organisms or else in parts, cells or 

propagation material derived therefrom, especially preferably in 
intact plants or else in any plant tissue or plant in-vitro 
cultures including callus. An in-vitro application using, for 
example, wheat germ extract or reticulocyte extract can also be 
10 envisaged. 

As described above, the DSBI enzyme can be generated using an 
expression cassette which comprises the DNA encoding a DSBI 
enzyme and is introduced into a eukaryotic cell or organism. In 

15 this context, the expression cassette for the DSBI enzyme 
preferably comprises a nucleic acid sequence encoding a DSBI 
enzyme as shown in SEQ ID NO: 2, 4, 6, 8 or 10 or a functional 
equivalent of same which is capable of generating DNA 
double-strand breaks in double-stranded DNA using the essentially 

20 identical recognition sequence. Essentially identical recognition 
sequences refer to those recognition sequences which, while 
deviating from the recognition sequence identified as optimal for 
the enzyme in question, still permit cleavage by this enzyme. 
Very especially preferably, the expression cassettes for the DSBI 

25 enzyme comprise a nucleic acid sequence as shown in SEQ ID NO: 1, 
3, 5, 7 or 9. 

Expression cassette — for example when referring to the 
expression cassette for the DSBI enzyme — means those 

30 constructions in which the DNA to be expressed is linked operably 
to at least one genetic control element that [sic] enables or 
regulates its expression (i.e. transcription and [lacuna] or 
translation). Here, expression may be for example stable or 
transient, constitutive or inducible. For introducing it, the 

35 skilled worker may resort to various direct methods (for example 
trans fection, particle bombardment, microinjection) or indirect 
methods (for example infection with agrobacteria, infection with 
viruses), all of which are detailed further below. 

40 Operable linkage is generally understood as meaning an 

arrangement in which a genetic control sequence is capable of 
exerting its function with regard to a nucleic acid sequence, for 
example while encoding a DSBI enzyme. Function, in this context, 
may mean for example control of the expression, i.e. 

45 transcription and/or translation, of the nucleic acid sequence, 
for example one encoding a DSBI enzyme. Control, in this context , 
encompasses for example initiating, increasing, governing or 
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suppressing the expression, i.e. transcription and, if 
appropriate, translation. Controlling, in turn, may be, for 
example, tissue- and [lacuna] or time-specific. It may also be 
inducible, for example by certain chemicals, stress, pathogens 
5 and the like. 

Operable linkage is understood as meaning for example the 
sequential arrangement of a promotor, of the nucleic acid 
sequence to be expressed - for example one encoding a DSBI enzyme 
10 - and, if appropriate, further regulatory elements such as, for 
example, a terminator, in such a way that each of the regulatory 
elements can fulfil its function when the nucleic acid sequence — 
for example one encoding a DSBI enzyme — is expressed. 

15 This does not necessarily require a direct linkage in the 

chemical sense. Genetic control sequences such as, for example, 
enhancer sequences are also capable of exerting their function on 
the target sequence from positions located at a distance or 
indeed other DNA molecules. Preferred arrangements are those in 

20 which the nucleic acid sequence to be expressed - for example one 
encoding a DSBI enzyme - is positioned after a sequence acting as 
promotor so that the two sequences are linked covalently to one 
another. The distance between the promotor sequence and the 
nucleic acid sequence — for example one encoding a DSBI enzyme — 

25 is preferably less than 200 base pairs, especially preferably 

less than 100 base pairs, very especially preferably less than. 50 
base pairs. 

The skilled worker is familiar with a variety of ways in order to 
30 obtain such an expression cassette. For example, it is preferably 
prepared by directly fusing a nucleic acid sequence which acts as 
promotor with a nucleotide sequence to be expressed - for example 
one encoding a DSBI enzyme. Operable linkage can be achieved by 
means of customary recombination and cloning techniques as are 
35 described, for example, in T. Maniatis, E.F. Fritsch and J. 
Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY (1989), in T.J. 
Silhavy, M.L. Berman and L.W. Enquist, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
40 (1984) and in Ausubel, F.M. et.al., Current Protocols in 
Molecular Biology, Greene Publishing Assoc. and Wiley 
Interscience (1987). 

However, an expression cassette may also be constructed in such a 
45 way that the nucleic acid sequence to be expressed (for example 
one encoding a DSBI enzyme) is brought under the control of an 
endogenous genetic control element, for example a promotor, for 
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example by means of homologous recombination or else by random 
insertion. Such constructs are likewise understood as being 
expression cassettes for the purposes of the invention. 

5 The skilled worker furthermore knows that nucleic acid molecules 
may also be expressed using artificial transcription factors of 
the zinc finger protein type (Beerli RR et al. (2000) Proc Natl 
Acad Sci USA 97(4) : 1495-500) . These factors can be adapted to 
suit any sequence region and enable expression independently of 
10 certain promotor sequences. 

The term "genetic control sequences" is to be understood in the 
broad sense and refers to all those sequences which affect the 
coming into existence, or the function, of the expression 

15 cassette according to the invention. For example, genetic control 
sequences ensure transcription and, if appropriate, translation 
in prokaryotic or eukaryotic organisms. Preferably, the 
expression cassettes according to the invention encompass 
5 '-upstream of the respective nucleic acid sequence "to be 

20 expressed a promotor and 3 '-downstream a terminator sequence as 
additional genetic control sequence, and, if appropriate, further 
customary regulatory elements, in each case in operable linkage 
with the nucleic acid sequence to be expressed. 

25 Genetic control sequences are described, for example, in 

"Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990)" or "Gruber and Crosby, in: 
Methods in Plant Molecular Biology and Biotechnolgy, CRC Press, 
Boca Raton, Florida, eds.:Glick and Thompson, Chapter 7, 89-108" 

30 and the references cited therein. 

Examples of such control sequences are sequences to which 
inductors or repressors bind and thus regulate the expression of 
the nucleic acid. The natural regulation of the sequences before 

35 the actual structural genes may still be present in addition to 
these novel control sequences or instead of these sequences and, 
if appropriate, may have been genetically modified in such a way 
that the natural regulation has been switched off and gene 
expression increased. However, the expression cassette may also 

40 be simpler in construction, that is to say no additional 

regulatory signals are inserted before the abovementioned genes, 
and the natural promotor together with its regulation is not 
removed. Instead, the natural control sequence is mutated in such 
a way that regulation no longer takes place and gene expression 

45 is increased. These modified promotors may also be placed on 
their own before the natural genes for increasing the activity. 
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A variety of control sequences are suitable, depending on the 
host organism or starting organism described in greater detail 
hereinbelow, which, owing to the introduction of the expression 
cassettes or vectors, becomes a genetically modified, or 
5 transgenic, organism. 

Advantageous control sequences for the expression cassettes or 
vectors according to the invention are present for example in 
promoters such as cos, tac, trp, tet, phoA, tat, lpp, lac, laclq, 
10 T7, T5, T3, gal, trc, ara, SP6, X-PR or in the X-PL promotor, 
which are advantageously used in Gram-negative bacteria. 

Further advantageous control sequences are present for example in 
the Gram-positive promotor s amy and SP02, in the yeast or fungal 
15 promotors ADC 1, MFa , AC, P-60, CYC1, GAPDH, TEF, rp28, ADH or in 
the plant promotors CaMV/35S (Franck et al. (1980) Cell 
21:285-294), PRP1 (Martini N et al. (1993) Mol Gen Genet. 
236(2-3) :179-186), SSU, OCS, LEB4, USP, STLS1, B33, NOS; FBPaseP 
(WO 98/18940) or in the ubiquitin or phaseolin promotors. 

20 

Vectors which are suitable for expression in vertebrates, 
preferably in mammals, are vectors like the TK promotor, the RSV 
3' LTR promotor, the CMV promotor or the SV40 early or late 
promotor. The skilled worker is familiar with other promotors. 

25 Inducible promotors suitable for use in vertebrates, preferably 
in mammals, encompass for example the Tet promotor /repre si sor, 
which is inducible or repressible by tetracylin or derivatives, 
the dexamethasone-inducible MMTV-LTR promotor, the Drosophila 
minimal heat shock promotor, which is inducible by ecdysone or 

30 the analog ponasterone A (for example within the pVgRXR 
expression system; Invitrogen, Inc.). 

A preferred promotor is, in principle, any promotor which is 
capable of controlling the expression of genes, in particular 

35 foreign genes, in plants. Preferred promotors are those which 
enable constitutive expression in plants (Benfey et al. (1989) 
EMBO J. 8:2195-2202). A promotor which is preferably used is, in 
particular, a plant promotor or a promotor derived from a plant 
virus. Especially preferred is the promotor of the cauliflower 

40 mosaic virus 35S transcript (Franck et al. (1980) Cell 

21:285-294; Odell et al.(1985) Nature 313 -.810-812; Shewmaker et 
al. (1985) Virology 140:281-288; Gardner et al. 1986, Plant Mol. 
Biol. 6, 221-228) or the 19S CaMV promotor (US 5,352,605 and 
WO 84/02913). It is known that this promotor comprises a variety 

45 of recognition sequences for transcriptional effectors which, in 
their totality, bring about permanent and constitutive expression 
of the gene introduced (Benfey et al. (1989) EMBO J 8:2195-2202). 
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A further suitable constitutive promotor is the Rubisco small 
subunit (SSU) promotor (US 4,962,028). A further example of a 
suitable promotor is the leguminB promotor (GenBank Acc.-No.: 
X03677). Further preferred constitutive promotors are, for 
5 example, the Agrobacterium nbpaline synthase promotor, the TR 
dual promotor, the agrobacterium OCS (octopine synthase) 
promotor, the ubiquitin promotor (Holtorf S et al. (1995) Plant 
Mol Biol 29:637-649), the promotors of the vacuolar ATPase 
subunits, or the promotor of a wheat prolin-rich protein (WO 
10 91/13991). 

The Expression cassettes may also comprise an inducible, 
preferably chemically inducible, promotor (Aoyama T and Chua NH 
(1997) Plant J 11:605-612; Caddick MX et al. (1998) Nat. 

15 Biotechnol 16:177-180; Rewiew: Gatz, Annu Rev Plant Physiol Plant 
Mol Biol 1997, 48:89-108), by means of which the expression of 
the exogenous gene in the plant can be controlled at a specific 
point in time. Such promotors, such as, for example, the PRP1 
promotor (Ward et al., Plant. Mol. Biol- 22 (1993), 361-366), a 

20 salicylic-acid-inducible promotor (WO 95/19443), a 

benzenesulfonamide-inducible promotor (EP-A-0388186) , a 
tetracyclin-inducible promotor. (Gatz et al., (1992) Plant J, 2, 
397-404), an abscisic acid-indue ible promotor (EP-A 335528), a 
salicylic acid- inducible promotor (WO 95/19443) or an ethanol- 

25 (Salter MG et al, (1998) Plant J. 16:127-132) or 

cyclohexanone-inducible (WO 93/21334) promotor may likewise be 
used. 

In an especially preferred embodiment, the DSBl-enzyme-encoding 
30 nucleic acid, in particular, is expressed under the control of an 
inducible promotor. This leads to a controlled, governable 
expression and deletion - for example in plants -, and any 
problems caused by a constitutive expression of a DSBI enzyme are 
avoided. 

35 

Other preferred promotors are promotors induced by biotic or 
abiotic stress, such as, for example, the pathogen-inducible 
promotor of the PRPl gene (Ward et al. , Plant Mol Biol 1993, 22: 
361-366), the tomato heat-inducible hsp80 promotor (US 
40 5,187,267), the potato chill-inducible alpha-amylase promotor (WO 
96/12814) or the wound-induced pinll promotor (EP375091). 

Other preferred promotors are promotors with specificity for the 
anthers, ovaries, pollen, the meristem, flowers, leaves, stems, 
45 roots and seeds. 
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A development-regulated promotor is, inter alia, described by 
Baerson et al. (Baerson SR, Lamppa GK (1993) Plant Mol Biol 
22(2):255-67). 

5 Especially preferred promotors are those which ensure expression 
in tissues or plant parts in which the biosynthesis of starch 
and/or oils or their precursors takes place or in which the 
products are advantageously accumulated. The biosynthesis site of 
starch are the chloroplasts of the leaves or the amyloplasts of 

10 the storage organs such as seeds, fruits or tubers . Within these 
organs, it is predominantly the cells of the endosperm or the 
cotyledons of the embryo in which synthesis takes place. 
Preferred promotors are thus in addition to the abovementioned 
constitutive promotors in particular seed-specific promotors such 

15 as, for example, the phaseolin promotor (US 5,504,200, Bustos MM 
et al., Plant Cell. 1989; 1(9) : 839-53) , the promotor of the 2S 
albumin gene ( Josef f son LG et al.(1987) J Biol Chem 262: 
12196-12201), the legumin promotor (Shirsat A et al. (1989) Mol 
Gen Genet. 215(2) :326-331) , the USP (unknown seed protein) 

20 promotor (Baumlein H et al. (1991) Molecular & General Genetics 
225(3) :459-67), the napin gene promotor (US 5,608,152; Stalberg 
K, et al. (1996) L. Planta 199: 515-519), the sucrose binding 
protein promotor (WO 00/26388) or the legumin B4 promotor (LeB4 ; 
Baumlein H et al. (1991) Mol Gen Genet 225:121-128; Baeumlein et 

25 al. (1992) Plant Journal 2(2) : 233-239; Fiedler U et al. (1995) 
Biotechnology (NY) 13. ( 10) :1090-1093) , the Ins Arabidopsis oleosin 
promotor (W09845461 ) , the Brassica Bee 4 promotor (WO 91/13980). 
Further suitable seed-specific promotors are those of the genes 
encoding the ''high-molecular -weight glutenin" (HMWG), gliadin, 

30 branching enzyme, ADP-glucose pyrophosphatase (AGPase) or starch 
synthase. Furthermore preferred promotors are those which enable 
seed-specific expression in monocots such as maize, barley, 
wheat, rye, rice and the like . Promotors which may advantageously 
be employed are the promotor of the lpt2 or lptl gene 

35 (WO 95/15389, WO 95/23230) or the promotors described in 

WO 99/16890 (promotors of the hordein gene, the glutelin gene, 
the oryzin gene, the prolamine gene, the gliadin gene, the 
glutelin gene, the zein gene, the kasirin gene or the secalin 
gene ) . 

40 

Promotors which are preferred as genetic control elements are, 
furthermore, pollen-specific promotors such as, for example, the 
promotor of the B. campestris bgpl gene (GenBank Acc.-No: X68210; 
Xu H et al. (1993) Mol Gen Genet 239 ( 1-2 ): 58-65; WO 94/13809), of 
45 the Oryza sativa ory s 1 gene (GenBank Acc.-No.: AJ012760; Xu H 
et al. (1995) Gene 164 (2 ) : 255-259) , of the pollen-specific maize 
gene ZMl 3 (Hamilton DA et al. (1998) Plant Mol Biol 
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38(4) :663-669; US 5,086,169)., and of the B.napus gene BplO 
(GenBank Acc.-No. : . X64257 ; Albani D (1992) Plant J 2(3) :331-342; 
US 6,013,859). 

5 Other preferred promotors are the Lcgl promotor for cell-specific 
expression in the male gametes (WO 99/05281; XU H et al. (1999) 
Proc. Natl. Acad. Sci. USA Vol. 96:2554-2558) and the promotor of 
the AtDMCl gene (Klimyuk VI et al.(1997) Plant J. 11(1):1-14). 

IQ Further suitable promotors are, for example, specific promotors 
for tubers> storage roots or roots such as, for example, the 
class I patatin promotor (B33), the potato cathepsin D inhibitor 
promotor, the starch synthase (GBSS1) promotor or the sporamin 
promotor, and fruit-specific promotors such as, for example, the 

25 tomato fruit-specific promotor (EP-A 409625) . 

Promotors which are furthermore suitable are those which ensure 
leaf -specific expression. Promotors which may be mentioned are 
the potato cytosolic FBPase promotor (WO 98/18940) , the Rubisco 
20 (ribulose-l,5-bisphosphate carboxylase) SSU (small subunit) 

promotor or the potato ST-LSI promotor (Stockhaus et al. (1989) 
EMBO J 8(9) :2445-2451) . Other preferred promotors are those which 
govern expression in seeds and plant embryos. 

25 Further suitable promotors are, for example, 

fruit-maturation-specific promotors such as, for example, the 
tomato fruit -maturation-specific promotor (WO 94/21794), 
flower-specific promotors such as, for example, the phytoene 
synthase promotor (WO 92/16635) or the promotor of the P-rr gene 

30 (WO 98/22593) or another node-specific promotor as described in 
EP-A 249676 may be used advantageously [sic]. 

In principle, all natural promotors together with their 
regulatory sequences, such as those mentioned above, may be used 
35 for the method according to the invention. In addition, synthetic 
promotors may also be used advantageously. 

Genetic control sequences also encompass further promotors, 
promotor elements or minimal promotors capable of modifying the 

40 expression-specific characteristics. Thus, for example, the 
tissue-specific expression may take place in addition as a 
function of certain stress factors, owing to genetic control 
sequences. Such elements are, for example, described for water 
stress, abscisic acid (Lam E and Chua NH (1991) J Biol Chem 

45 266(26) :17131 -17135) and heat stress (Schoffl F et al. (1989) 
Molecular & General Genetics 217(2-3) : 246-53 ) . 
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Furthermore, other promotors which enable expression in further 
plant tissues or other organisms, such as, for example, E.coli 
bacteria, may be linked oper ably with the nucleic acid sequence 
to be expressed. Plant promotors which are suitable are, in 
5 principle, all of the above-described promotors. 

Genetic control sequences furthermore also encompass the the 
5' -untranslated region, introns or the noncoding 3 '-region of 
genes. It has been demonstrated that they may play a significant 

10 role in the regulation of gene expression. Thus, it has been 
demonstrated that 5 '-untranslated sequences are capable of 
enhancing the transient expression of heterologous genes. 
Furthermore, they may promote tissue specificity (Rouster J et 
al., Plant J. 1998/ 15: 435-440.). Conversely, the 

15 5' -untranslated region of the opaque-2 gene suppresses 

expression. Deletion of the region in question leads to an 
increased gene activity (Lohmer S et al., Plant Cell 1993, 
5:65-73). 

20 Genetic control sequences may also encompass ribosome binding 
sequences for initiating translation. This is preferred in 
particular when the nucleic acid sequence to be expressed does 
not provide suitable sequences or when they are not compatible 
with the expression system. 

25 

The expression cassette can advantageously comprise one or more 
of what are known as enhancer sequences in operable linkage with 
the promotor, which enable the increased transgenic expression of 
the nucleic acid sequence. Additional advantageous sequences, 
30 such as further regulatory elements or terminators, may also be 
inserted at the 3' end of the nucleic acid sequences to be 
expressed recombinant ly. One or more copies of the nucleic acid 
sequences to be expressed recombinantly may be present in the 
gene construct. 

35 

Genetic control sequences are furthermore understood as meaning 
sequences which encode fusion proteins consisting of a signal 
peptide sequence. 

40 Polyadenylation signals which are suitable as genetic control 
sequences are plant polyadenylation signals, preferably those 
which correspond essentially to T-DNA polyadenylation signals 
from Agrrobacterium tumefaciens, in particular of gene 3 of the 
T-DNA (octopine synthase) of the Ti plasmids pTiACHS (Gielen 

45 et al., EMBO J. 3 (1984), 835 et sec.) or functional equivalents 
thereof. Examples of particularly suitable terminator sequences 
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are the OCS (octopine synthase) terminator and the NOS (nopaline 
synthase) terminator. 

As mentioned above, the recombination constructs according to the 
5 invention may encompass further nucleic acid sequences. Such 
nucleic acid sequences may preferably constitute expression 
cassettes. The following may be mentioned by way of example of 
the DNA sequences to be expressed in the expression constructs, 
but not by way of limitation: 

10 

i) Positive selection markers: 

As a rule, selection markers are required for selecting cells 
which have successfully under gone homologous recombination or 

15 transformation. The selectable marker which has been 

introduced together with the expression construct confers 
resistance to a biocide (for example a herbicide such as 
phosphinothricin, glyphosate or bromoxynil), a metabolism 
inhibitor such as 2~deoxyglucose-6-phosphate (WO 98/45456) or 

20 an antibiotic such as, for example, tetracyclins [sic], 
ampicillin* kanamycin, G 418, neomycin, bleomycin or 
hygromycin to the cells which have successfully undergone 
recombination or transformation. The selection marker permits 
the selection of the transformed cells from untransf ormed 

25 cells (McConnick et al., Plant Cell Reports 5 (1986), 81-84). 

Especially preferred selection markers are those which confer 
resistance to herbicides. Examples of selection markers which 
may be mentioned are : 

30 - DNA sequences which encode phosphinothricin 

acetyltransf erases [sic] (PAT), which acetylates the free 
amino group of the glutamine synthase inhibitor 
phosphinothricin (PPT) and thus brings about 
detoxification of the PPT (de Block et al. 1987, EMBO J. 

35 6, 2513-2518) (also referred to as Bialophos® 

resistence gene (bar)), 

5-enolpyruvylshikimate-3-phosphate synthase genes (EPSP . 
synthase genes), which confer resistence to Glyphosate® 
40 (N-(phosphonomethyl) glycine) , 

the gox gene, which encodes the Glyphosate®-degrading 
enzymes [sic] (Glyphosate oxidpreductase) , 

45 - the deh gene (encoding a dehalogenase which inactivates 

Dalapon®), 
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acetolactate synthases which inactivate sulfonylurea and 
imidazolinone, 

bxn genes which encode Bromoxynil^-degrading nitrilase 
5 enzymes, 

the kanamycin, or G418, resistence gene (NPTII). The 
NPTII gene encodes a neomycin phosphotransferase which 
reduces the inhibitory effect of kanamycin, neomycin, 
10 G418 and paromomycin owing to a phosphorylation reaction, 

the D0G R 1 gene. The D0G*1 gene has been isolated from the 
yeast Saccharomyces cerevisiae (EP 0 807 836). It encodes 
a 2-deoxyglucose-6-phosphate phosphatase which confers 
15 resistence to 2-DOG (Randez-Gil et al. 1995, Yeast 11, 

1233-1240). 

ii) Negative selection markers enable for example the selection 
of organisms with successfully deleted sequences which 

20 encompass the marker gene (Koprek T et al. (1999) The Plant 
Journal 19<6) :719-726) . TK thymidine kinase (TK) and 
diphtheria toxin A fragment (DT-A) , codA gene encoding a 
cytosine deaminase (Gleve AP et al. (1999) Plant Mol Biol. 
40(2) :223-35; Pereat RI et al. (1993) Plant Mol. Biol 23(4): 

25 793-799; Stougaard J; (1993) Plant J 3:755-761), the 

cytochrome P450 gene (Koprek et al. (1999) Plant J. 
16:719-726), genes encoding a haloalkane dehalogenase 
(Naested H (1999) Plant J. 18:571-576), the iaaH gene 
(Sundaresan V et al. (1995) Genes & Development 9:1797-1810) 

30 or the tms2 gene (Fedoroff NV & Smith DL 1993, Plant J 3: 

273- 289). 

iii) Report genes which encode readily quantifiable proteins and 
which, via intrinsic color or enzyme activity, ensure the 

35 assessment of the transformation efficacy or of the location 
or timing of expression. Very especially preferred here are 
genes encoding reporter proteins (see also Schenborn E, 
Groskreutz D. Mol Biotechnol. 1999; 13(l):29-44) such as 

40 - "green fluorescence [sic] protein" (GFP) (Chui WL et al., 
Curr Biol 1996, 6:325-330; Leffel SM et al., 
Biotechniques. 23(5):912-8, 1997; Sheen et al.(1995) 
Plant Journal 8(5) : 777-784; Haseloff et al.(1997) Proc 
Natl Acad Sci USA 94( 6) :2122-2127; Reichel et al.(1996) 

45 Proc Natl Acad Sci USA 93 ( 12 ) :5888-5893 ; Tian et al. 

(1997) Plant Cell Rep 16:267-271; WO 97/41228). 
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Chloramphenicol transferase , 

luciferase (Millar et al., Plant Mol Biol Rep 1992 
10:324-414; Ow et al. (1986) Science, 234:856-859); 
5 permits the detection of bioluminescence, 

(J-galactosidase, encodes an enzyme for which a variety of 
chromogenic substrates are available, 

10 - ^-glucuronidase (GUS) (Jefferson et al., EMBO J. 1987, 6, 
3901-3907) or the uidA gene, which encodes an enzyme for 
a variety of chromogenic substrates, 

R locus gene product: protein which regulates the 
15 production of anthocyanin pigments (red coloration) in 

plant tissue and thus makes possible the direct analysis 
of the promotor activity without the addition of 
additional adjuvants or chromogenic substrates 
(Dellaporta et al., In: Chromosome Structure and 
20 Function: Impact of New Concepts, 18th Stadler Genetics 

Symposium, 11:263-282, 1988), 

J3-lactamase (Sutcliffe (1978) Proc Natl Acad Sci USA 
75:3737-3741), enzyme for a variety of chromogenic 
25 substrates (for example PAD AC, a chromogenic 

cephalosporin) , 

- xylE gene product (Zukowsky et al. (1983) Proc Natl Acad 
Sci USA 80:1101-1105), catechol dioxygenase capable of 
30 converting chromogenic catechols, 

alpha-amylase (Ikuta et al. (1990) Bio/technol. 
8:241-242), 

35 - tyrosinase (Katz et al.(1983) J Gene Microbiol 

129:2703-2714), enzyme which oxidizes tyrosine to give 
DOPA and dopaquinone which subsequently form melanine, 
which is readily detectable, 

40 - aequorin (Prasher et al.(1985) Biochem Biophys Res Commun 
126(3) : 1259-1268) , can be used in the calcium-sensitive 
bioluminescence detection. 

The recombination construct according to the invention and any 
45 vectors derived from them [sic] may comprise further functional 
elements. The term "further functional elements" is to be 
understood in the broad sense. It preferably refers to all those 
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elements which affect the generation, multiplication, function, 
use or value of the recombination system according to the 
invention, recombination construct according to the invention or 
cells or organisms comprising them. The following may be 
5 mentioned by way of example, but not by limitation, of the 
further functional elements: 

Replication origins which ensure replication of the 
expression cassettes or vectors according to the invention 
in, for example, E. coli. Examples which may be mentioned are 
ORI (origin of DNA replication), the pBR322 ori or the PI 5 A 
ori (Sambrook et al.: Molecular Cloning. A Laboratory Manual, 
2 nd ed. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989) . 

Multiple cloning regions (MCS) enable and facilitate the 
insertion of one or more nucleic acid sequences. 

vi) Sequences which make possible homologous recombination or 
20 insertion into the genome of a host organism. 

vii) Elements, for example border sequences, which make possible 
the agrobacterium-mediated transfer in plant cells for the 
transfer and integration into the plant genome, such as, for 

25 example, the right or left border of the T-DNA or the vir 
region. 

All of the abovementioned expression cassettes or further 
functional elements may be located, as mentioned, between the 
30 homology sequences A and B. However, they may also be located 
outside them. This is advantageous in particular in the case of 
border sequences . 

A recombination cassette or expression construct according to the 
35 invention for a DSBI enzyme may advantageously be introduced into 
cells using vectors into which these constructs or cassettes are 
inserted. Examples of vectors may be plasmids, cosmids, phages, 
viruses, retroviruses or else agrobacteria . 

40 Vectors for expression in E.coli are preferably pQE70, pQE60 and 
pQE-9 (QIAGEN, Inc.); pBluescript vectors, >Phagescript vectors, 
pNH8A, pNHl6a, pNH18A, pNH46A (Stratagene Cloning Systems, inc.); 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRlT5 (Pharmacia Biotech, 
Inc. ) . 

45 
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Preferred vectors for eukaryotic expression encompass pWLNEO, 
pSV2CAT, pOG44 f pXTl and pSG (Stratagene Inc.)? pSVK3, pBPV, pMSG 
and pSVL (Pharmacia Biotech, Inc.)- Inducible vectors which may 
be mentioned are pTet-Thia, Potter-Splice, pcDNA4/TO, pcDNA4/TO 
5 /LacZ, pcDNA6/TR, pcDNA4/TO/Myc-His /LacZ, pcDNA4/TO/Myc-His A, 
pcDNA4/T0/Myc-His B, pcDNA4/T0/Myc-His C, pVgRXR (Invitrogen, 
Inc.) or the pMAM series (Clontech, Inc.; GenBank Accession No.: 
U02443). These already provide the inducible regulatory control 
element for example for a chemical inducible expression of a DSBI 
10 enzyme. The nucleic acid sequence encoding a DSBI en2yme may be 
inserted directly into these vectors. 

Vectors for the expression in yeast encompass by way of example 
pYES2, pYDl, pTEFl/Zeo, pYES2/GS, pPICZ # pGAPZ, pGAPZalph, pPIC9, 
15 pPIC3.5, PHIL-D2, PHIL-SI, pPIC3SK, pPIC9K and PA0815 
(Invitrogen, Inc.). 

In an advantageoue embodiment, the expression cassette is 
introduced by means of plasmid vectors. Preferred vectors are 
20 those which enable the stable integration of the expression 
cassette into the host genome. 

Another subject matter of the invention relates to eukaryotic 
transgenic organisms comprising the recombination system 
25 according to the invention, and to cells, cell cultures, tissues, 
parts or propagation material — such as, for example, in the case 
of plant organisms leaves, roots, seeds, fruit, pollen and the 
like — derived from such organisms. 

30 Eukaryotic organism, starting organism or host organism refers to 
higher and lower, single- and multi-celled eukaryotic organisms. 
Also encompassed are eukaryotic microorganisms such as, for 
example, yeasts, algae or fungi. 

35 Preferred yeasts are Candida, Saccharomyces, Hansenula or Pichia, 
with Saccharomyces cerevisiae or Pichia pastoris (ATCC Accession 
No. 201178) being particularly preferred. 

Preferred fungi are Aspergillus, Trichoderma, Ashbya, Neurospora, 
40 Fusarium, Beauveria or other fungi described in Indian Chem Engr. 
Section B. Vol 37, No 1,2 (1995) on page 15, Table 6. The 
filamentous Hemiascomycete Ashbya gossypii is particularly 
preferred. 

45 Host or starting organisms which are preferred in accordance with 
the invention are, furthermore, animal organisms and cells or 
tissue derived from them. Animal organisms encompass preferably 
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vertebrates and invertebrates. Especially preferred vertebrates 
are mammals such as in [sic] dogs, cats, sheep, goats, chickens, 
mice, rats, bovines or horses. Preferred animal cells encompass 
CHO, COS and HEK293 ceils. Preferred invertebrates encompass 
5 insect cells such as Drosophila S2 and Spodoptera Sf9 or Sf21 
cells. 

Host organisms or starting organisms which are preferred as 
transgenic organisms are especially plants. Included within the 

10 scope of the invention are all genera and species of higher and 
lower plants of the plant kingdom. Included are furthermore the 
mature plants, seed, shoots and seedlings, and parts, propagation 
material (for example seeds and fruit) and cultures, for example 
cell cultures, derived therefrom. Mature plants is to be 

15 understood as meaning plants at any developmental stage beyond 
the seedling. Seedling is to be understood as meaning a young, 
immature plant in an early developmental stage. 

The recombination system according to the invention may 
20 preferably be used for the following plant families: 

Amaranthaceae, Brassicaceae, Carophyllaceae, Chenopodiaceae, 
Compositae , Cucurbitaceae , Labiatae , Leguminosae-Papilionoideae, 
Liliaceae, Linaceae, Malvaceae, Rosaceae, Saxifragaceae, 
Scrophulariaceae, Solanacea [sic], Tetragoniacea [sic]. 

25 

Annual, perennial, monocotyledonour and dicotyledonous plants are 
preferred host organisms for the generation of transgenic plants. 
The use of the recombination system, or method according to the 
invention is furthermore advantageous in all ornamental plants, 

30 useful or ornamental trees, flowers, cut flowers, shrubs or turf. 
Plants which may be mentioned by way of example but not by 
limitation are angiosperms, bryophytes such as, for example, 
Hepaticae (hepaticas) and Muse i (mosses); pteridophytes such as 
ferns, horsetail and clubmosses; gymnosperms such as conifers, 

35 cycads, ginkgo and Gnetaeae; algae such as Chlorophyceae, 

Phaeophpyceae [sic] , Rhodophyceae, Myxophyceae, Xanthophyceae , 
Bacillariophyceae (diatoms) and Euglenophyceae . 

Plants for the purposes of the invention comprise by way of 
40 example and not by way of limitation the families of the Rosaceae 
such as rose, Ericaceae such as rhododendrons and azaleas, 
Euphorbiaceae such as poinsettias and croton, Caryophyllaceae 
such as pinks, Solanaceae such as petunias, Gesneriaceae such as 
African violet, Balsaminaceae such as touch-me-not, Orchidaceae 
45 such as orchids, Iridaceae such as gladioli, iris, frees;ia and 
crocus, Compositae such as marigold, Geraniaceae such as 
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geraniums, Liliaceae such as drachaena, Moraceae such as ficus, 
Araceae such as philodendron and many others. 

Flowering plants which may be mentioned by way of example but not 
5 by limitation are the families of the Leguminosae such as pea, 
alfalfa and soya; Gramineae such as rice, maize, wheat; 
Solanaceae such as tobacco and and many others; the family of the 
Umbelliferae, particularly the genus Daucus (very particularly 
the species carota (carrot)) and Apium (very particularly the 

10 species gravedlens dulce (celery)) and many others; the family of 
the Solanacea [sic], particularly the genus Lycopersicon, very 
particularly the species esculentum (tomato) and the genus 
Solanum, very particularly the species tuberosum (potato) and 
melongena (aubergine) and many others; and the genus Capsicum, 

15 very particularly the species annum (peppers) and many others; 
the family of the Leguminosae, particularly the genus Glycine, 
very particularly the species max (soybean) and many others; and 
the family of the Cruciferae, particularly the genus Brassica, 
very particularly the species napus (oilseed rape) , campestris 

20 (beet), oleracea cv Tastie (cabbage), oleracea cv Snowball Y 
(cauliflower) and oleracea cv Emperor (broccoli); and the genus 
Arabidopsis, very particularly the species thaliana and many 
others; the family of the Compositae, particularly the genus 
Lactuca, very particularly the species sativa (lettuce) and many 

25 others. 

The transgenic plants according to the invention are selected in 
particular among monocotyledonous crop plants, such as, for 
example, cereals such as wheat, barley, sorghum and millet, rye, 
30 triticale, maize, rice or oats, and sugar cane. The transgenic 
plants according to the invention are furthermore selected in 
particular from among dicotyledonous crop plants such as, for 
example, 

35 Brassicaceae such oilseed rape, cress, Arabidopsis, cabbages or 
canola, Leguminosae such as soya, alfalfa, peas, beans or peanut 

Solanaceae such as potato, tobacco, tomato, aubergine or peppers, 
Asteraceae such as sunflower, Tagetes, lettuce or Calendula, 

40 

Cucurbitaceae such as melon, pumpkin/ squash or courgette, 

and linseed, cotton, hemp, flax, red pepper, carrot, carrot, 
(sic] sugar beet and the various tree, nut and wine species. 

45 
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Especially preferred are Arabidopsis thaliana, Nicotiana tabaciim 
and oilseed rape and all genera and species which are used as 
food or feedstuffs, such as the above -de scribed cereal species, 
or which are suitable for the production of oils, such as oil 
5 crops (such as, for example, oilseed rape), nut species, soya, 
sunflower, pumpkin/ squash and peanut. 

Plant organisms are furthermore, for the purposes of the 
invention, other organisms which are capable of photosynthetic 
10 activity, such as, for example, algae or cyanobacteria, and also 
mosses. 

Preferred algae are green algae, such as, for example, algae of 
the genus Haematococcus , Phaedactylum tricornatum, Volvox or 
15 Dunaliella. 

The generation of a transformed organism or a transformed cell 
requires introducing the DNA in question into the host cell in 
question. A multiplicity of methods is available for this 

20 procedure, which is termed transformation (see also Keown et al. 
1990 Methods in Enzymology 185:527-537). For example, the DNA can 
be introduced directly by microinjection or by bombardment with 
DNA-coated micr opart icles. Also, the cell can be permeabilized 
chemically, for example using polyethylene glycol, so that the 

25 DNA can enter the cell by diffusion. The DNA can also be 

introduced by protoplast fusion with other DNA-containing units 
such as minicells, cells, lysosomes or liposomes. Another 
suitable method of introducing DNA is electroporation, where the 
cells are permeabilized reversibly by an electrical pulse. 

30 Preferred general methods which may be mentioned are the 
calcium-phosphate-mediated transf ection, the 

DEAE-dextran -mediated trans feet ion, the cationic lipid-mediated 
transf ection, electroporation, transduction and infection. Such 
methods are known to the skilled worker and described, for 
35 example, in Davis et al., Basic Methods In Molecular Biology 
(1986). 

In plants, methods for transforming and regenerating plants from 
plant tissues or plant cells with which the skilled worker is 

40 familiar are exploited for transient or stable transformation. 
Suitable methods are especially protoplast transformation by 
means of polyethylene-glycol-induced DNA uptake, biolistic 
methods such as the gene gun ("particle bombardment" method), 
electroporation, the incubation of dry embryos in DNA-containing 

45 solution, sonication and microinjection, and the transformation 
of intact cells or tissues by micro- or macroinj ection into 
tissues or embryos, tissue electroporation, incubation of dry 
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embryos in DNA-containing solution or vacuum infiltration of 
seeds. In the case, of injection or electroporation of DNA into 
plant cells, the plasmid used need not meet any particular 
requirement. Simple plasmids such as those of the pUC series may 
5 be used. If intact plants are to be regenerated from the 
transformed cells, the presence of an additional selectable 
marker gene on the plasmid is useful. 

Any plant tissue may act as target material. Likewise, expression 
10 may take place in callus, embryogenic tissue or somatic embryos. 

In addition to these "direct" transformation techniques, 
transformation can also be carried out by bacterial infection by 
means of Agrobacterium tumef aciens or Agrobacterium rhizogenes . 
15 These strains contain a plasmid (Ti or Ri plasmid) . Part of this 
plasmid, termed T-DNA (transferred DNA), is transferred to the 
plant following agrobacterial infection and integrated into the 
genome of the plant cell. 

20 The recombination construct or the expression cassette for the 
DSBI enzyme is - preferably integrated into specif ic plasmids, 
either into a shuttle, or intermediate, vector or into a binary 
vector). If, for example, a Ti or Ri plasmid is to be used for 
the transformation, at least the right border, but in most cases 

25 the right and the left border, of the Ti or Ri plasmid T-DNA is 
linked with the expression cassette to be introduced as a 
flanking region. Binary vectors are preferably used. Binary 
vectors are capable of replication both in E. coll and in 
Agrobacterium. As a rule, they contain a selection marker gene 

30 and a linker or polylinker flanked by the right or left T-DNA 
flanking sequence. They can be transformed directly into 
Agrobacterium (Holsters et al.,Mol. Gen. Genet. 163 (1978), 
181-187). The selection marker gene permits the selection of 
transformed agrobacteria and is, for example, the nptll gene, 

35 which imparts resistance to kanamycin. The agrobacterium, which 
acts as host organism in this case, should already contain a 
plasmid with the vir region. The latter is required for 
transferring the T-DNA to the plant cell. An agrobacterium thus . 
transformed can be used for transforming plant cells. 

40 

The use of Agrobacterium tumefaciens for the transformation of 
plants using tissue culture explants has been described by Horsch 
et al. (Horsch RB (1986) Proc Natl Acad Sci USA 83(8) :2571-2575) r 
Fraley et al. (Fraley et al. 1983, Proc. Natl. Acad. Sci. USA 80, 
45 4803-4807) and Bevans et al. (Bevans et al. 1983, Nature 304, 
184-187). Many strains of Agrobacterium tumefaciens are capable 
of transferring genetic material - for example the recombination 
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constructs according to the invention -, such as, for example, 
the strains EHA101[pEHAl01] , EHA105 [pEHA105 J , LBA4404[pAL4404] , 
C58Cl[pMP90] and C58Cl[pGV2260] . The strain EHA101[pEHA101] has 
been described by Hood et al. (Hood EE et al. (1996) J Bacteriol 
5 168(3) :1291-1301) f the strain EHA105[pEHA105 ] by Hood et al. 
(Hood et al. 1993, Transgenic Research 2, 208-218), the strain 
LBA4404[pAL4404] by Hoekema et al. (Hoekema et al. 1983, Nature 
303, 179-181), the strain C58Cl[pMP90] by Koncz and Schell (Koncz 
and Schell 1986 , Mol. Gen. Genet. 204, 383-396), and the strain 
10 C58Cl[pGV2260] by Deblaere et al. (Deblaere et al. 1985, Nucl. 
Acids Res. 13, 4777-4788). 

The agrobacterial strain employed for the transformation 
comprises, in addition to its disarmed Ti plasmid, a binary 

15 plasmid with the T-DNA to be transferred, which, as a rule, 

comprises a gene for the selection of the transformed cells and 
the gene to be transferred. Both genes must be equipped with 
transcriptional and translational initiation and termination 
signals. The binary plasmid can be transferred into the 

20 agrobacterial strain for example by electroporation or other 
transformation methods (Mozo & Hooykaas 1991, Plant Mol. Biol. 
16, 917-918). Coculture of the plant explants with the 
agrobacterial strain is usually performed for two to three days. 

25 A variety of vectors could, or can, be used. In principle, one 
differentiates between those vectors which can be employed for 
the agrobacterium-mediated transformation or agroinf ection, i.e. 
which comprise the recombination constructs, or the expression 
cassette, for the expression of the DSBI enzyme within a T-DNA, 

30 which indeed permits stable integration of the T-DNA into the 
plant genome. Moreover, border-sequence-free vectors may be 
employed, which can be transformed into the plant cells for 
example by particle bombardment, where they can lead both to 
transient and to stable expression. 

35 

The use of T-DNA for the transformation of plant cells has been 
studied and described intensively (EP 120516; Hoekema, In: The 
Binary Plant Vector System, Of f setdrukkeri j Kanters B. v., 
Alblasserdam, Chapter V; Fraley et al.,. Crit. Rev. Plant. Sci., 
40 4:1-46 and An et al., EMB0 J. 4 (1985), 277-287). Various binary 
vectors are known, some of which are commercially available such 
as, for example, pBIN19 (Clontech Laboratories, Inc. USA). 

To transfer the DNA to the plant cell, plant explants are 
45 cocultured with Agrobacterium tumefaciens or Agrobacterium 

rhizogenes. Starting from infected plant material (for example 
leaf, root or stalk sections, but also protoplasts or suspensions 
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of plant cells), intact plants can be regenerated using a 
suitable medium which may contain, for example, antibiotics or 
biocides for selecting transformed cells. The plants obtained can 
then be screened in the presence of the DNA introduced, in this 
5 case the recombination construct or the expression cassette for 
the DSBI enzyme according to the invention. As soon as the DNA 
has integrated into the host genome, the genotype in question is, 
as a rule, stable, and the insertion in question is also found in 
the subsequent generations. As a rule, the expression cassette 

10 integrated contains a selection marker which confers a resistance 
to a biocide (for example a herbicide) or an antibiotic such as 
kanamycin, G 418, bleomycin, hygromycin or phosphinotricin and 
the like to the transformed plant. The selection marker permits 
the selection of transformed cells (McCormick et al., Plant Cell 

15 Reports 5 (1986), 81-84). The plants obtained can be cultured and 
hybridized in the customary fashion. Two or more generations 
should be grown in order to ensure that the genomic integration 
is stable and hereditary. 

20 The abovementioned methods are described, for example, in B. 
Jenes et al., Techniques for Gene Transfer, in: Transgenic 
Plants, Vol. 1, Engineering and Utilization, edited by S.D. Rung 
and R. Wu, Academic Press (1993), 128 - 143 and in Potrykus, 
Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991), 205-225). 

25 The construct to be expressed is preferably cloned into a vector 
which is suitable for the transformation of Agrobacterium 
tumefaciens, for example pBinl9 (Bevan et al., Nucl. Acids Res. 
12 (1984), 8711). 

30 Agrobacterium-mediated transformation is suited best to 

dicotyledonous plant cells, whereas the direct transformation 
techniques are suitable for any cell type. 

Transformed cells, i.e. those which comprise the DNA integrated 
35 into the DNA of the host cell, can be selected from untrans formed 
cells if a selectable marker is part of the DNA introduced. A 
marker can be, for example, any gene which is capable of 
conferring a resistance to antibiotics or herbicides. Transformed 
cells which express such a marker gene are capable of surviving 
40 in the presence of concentrations of a suitable antibiotic or 

herbicide which kill an untransf ormed wild type. Various positive 
and negative selection markers are described hereinabove. 
Examples are the bar gene, which confers resistence to the 
herbicide phosphinothricin (Rathore KS et al., Plant Mdl Biol. 
45 1993 Mar; 21(5) : 871-884) i, the nptll gene, which confers resistance 
to kanamycin, the hpt gene, which confers resistence to 
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hygromycin, or the EPSP gene, which confers resistence to the 
herbicide Glyphosate. 

As soon as a transformed plant cell has been generated, an intact 
5 plant can be obtained using methods known to the skilled worker. 
For example, callus cultures are used as starting material. The 
formation of shoot and root can be induced in this as yet 
undifferentiated cell biomass in the known fashion. The shoots 
obtained can be planted and cultured. 

10 

Also in accordance with the invention are cells, cell cultures, 
parts — such as, for example, in the case of transgenic plant 
organisms, roots, leaves and the like — derived from the 
above-described transgenic organisms, and transgenic propagation 
15 material (such as seeds or fruits). 

Genetically modified plants according to the invention which can 
be consumed by humans or animals can also be used as food or 
feedstuffs, for example directly or following processing known 
20 per se. Here, the deletion of, for example, resistances to 

antibiotics and/or herbicides, as are frequently introduced when 
generating the transgenic plants , makes sense for reasons of 
customer acceptance, but also product safety. 

25 A further subject matter of the invention relates to the use of 
the above-described transgenic organisms according to the 
invention and the cells, cell cultures, parts — such as, for 
example, in the case of transgenic plant organisms, roots, leaves 
and the like — derived from them, and transgenic propagation 

30 material such as seeds or fruits, for the production of food or 
feedstuffs, pharmaceuticals or fine chemicals. Here again, the 
deletion of, for example, resistances to antibiotics and/or 
herbicides is advantageous for reasons of customer acceptance, 
but also product safety. 

35 

Fine chemicals is understood as meaning enzymes, vitamins, amino 
acids, sugars, fatty acids, natural and synthetic flavors, aromas 
and colorants widely usable [sic]. Especially preferred is the 
production of tocopherols and tocotrienols, and of carotenoids. 

40 Culturing the transformed host organisms, and isolation from the 
host organisms or from the culture medium, is performed by 
methods known to the skilled worker. The production of 
pharmaceuticals such as, for example, antibodies or vaccines, is 
described by Hood EE, Jilka JM. (1999) Curr Opin Biotechnol. 

45 10(4) : 382-386; Ma JK and Vine ND (1999) Curr Top Microbiol 
Immunol. 236: 275-92 ) . 
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The recombination system or method according to the invention 
furthermore of fers. various advantageous uses which can not be 
achieved with the deletion methods described in the prior art. 
Various use examples are described hereinbelow by way of example, 
5 but not by limitation: 

1. Simple deletion of a nucleic acid sequence from the 
chromosomal DNA of an organism: 

10 Using any homology sequences A and B, nucleic acid sequences 
located between them can be deleted. The sequence which is 
the result of the recombination of the homology sequences A 
and B remains in the genome. The method is suitable for 
example for removing, from the chromosomal DNA, selection 

15 markers after a transgenic organism, for example a transgenic 
plant, has been generated. The method is shown schematically 
in Figs. 2 and 3, Fig. 2 showing the variant with one 
recognition sequence for the site-directed induction of DNA 
double-strand breaks and Fig. 3 showing the variant with two 

20 recognition sequences for the site-directed induction of DNA 
double-strand breaks. 

2. Complete deletion of recombinantly introduced heterologous 
nucleic acid sequences from the chromosomal DNA of an 

25 organism: 

Using homology sequences A and B, which are homologous to 
certain sequences of the organism, the expression construct 
can be introduced into the organism by homologous 

30 recombination. Using the recombination system or method 

according to the invention, the nucleic acid sequences 
located between the homology sequences would be deleted. The 
induced homologous recombination between homology sequences A 
and B restores the original sequence. All of the construct is 

35 removed from the chromosomal DNA. The method is suitable for 
example for removing selection markers from the chromosomal 
DNA after a transgenic plant has been generated. Furthermore, 
the system or method according to the invention is suitable . 
for expressing certain proteins transiently in order to 

40 achieve an advantageous effect and then to switch them off 

using an induced DSBI enzyme expression or activity by 
irreversibly removing the gene in question from the genome. 
The method is shown schematically in Fig. 4, the variant with 
two recognition sequences for the site-directed induction of 

45 DNA double-strand breaks being shown. The system can also be 

realized using one recognition sequence; however, two 
cleavage sites are advantageous in the case of larger 
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insertions between the homology sequences A and B since this 
allows the deletion efficacy and homologous recombination 
efficacy to be increased further (further recognition 
sequences may be located within the sequence region to be 
5 deleted) . . 

3. Induced gene activation by the site-directed deletion of 
nucleic acid sequences: 

10 Using homology sequences A and B, whose homologous 

recombination restores for example a complete open reading 
frame of a protein, or a functional promotor, the inducible 
expression of target proteins can be realized as a function 
of the presence of the DSBI enzyme. Using the recombination 

15 system or method according to the invention , the nucleic acid 
sequences located between the homology sequences would be 
deleted. The method is shown schematically in Figs. 5 and 6, 
Fig. 6 showing a specific embodiment of the general method 
shown in Fig. 5 in which the recombination construct is 

20 inserted into an endogenous gene at an earlier point in time 

by means of homologous recombination, thus enabling this gene 
to be inducibly activated as a function of the presence of 
the DSBI enzyme. Fig. 7a illustrates the system of gene 
activation with reference to a specific use example where the 

25 p-glucuronidase (GUS) gene is reconstituted using the system 

or method according to the invention, enabling a color 
reaction to take place (see description to Fig. 7a and 
Examples). 

30 4. Readily selectable system for the deletion of a nucleic acid 
sequence from the chromosomal DNA of an organism: 

In a preferred embodiment, the recombination construct 
comprises a positive and a negative selection marker (and, if 

35 appropriate, further nucleic acid sequences to be deleted) in 

such a way that both markers are deleted when the 
double-strand breaks are induced. Such a system is shown in 
Figs. 8 and 9 (A). Moreover, the expression cassette for the 
DSBI enzyme may also be present between the homology 

40 sequences (Fig. 10 (B)), expression preferably being effected 
under the control of an inducible promotor (Pi) (for example: 
Aoyama T and Chua NH (1997) Plant J 11:605-612; Caddick MX et 
al. (1998) Nat. Biotechnol 16:177-180). As already described, 
further nucleic acid sequences may be present (Fig. 9 (C)). 

45 
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Expression of the DSBI enzyme leads in both cases to the 
elimination of the DNA sequences located between the two 
recognition sequences and the recombination of the homologous 
sequences. Since the cells simultaneously lose a negative 
5 selection marker, the cells with a successful deletion can be 
identified by means of selection (Gleave AP et al.(1999) 
Plant Mol Biol. 40:223-235). 

In the case of plant cells, for example, the resulting cells 
10 can be used for regenerating and propagating the 

corresponding intact plants, which now no longer contain any 
marker genes. 

5. Genetic manipulation of the host genome: 

15 

The recombination system or method according to the invention 
can be used for in-situ modifications of the host genome. 
Thus, for example, a homology sequences [sic] may already 
exist endogenous ly in the genome. After insertion of the 
20 second homology sequence, which is linked with a DSBI enzyme 
recognition sequence, any regulatory or coding sequences 
located between the homology sequences A and B are eliminated 
from the genome • 

25 At the same time, it is conceivable that the recombination 
construct encompasses regulatory or coding sequences which, 
are eliminated from the organism once deletion has taken 
place. Thus, it is possible for example to regulate 
transiently an endogenous gene in a site -directed fashion. 

30 

In a further preferred embodiment, the efficacy of the 
recombination system is increased by combination with systems 
which promote homologous recombination. Such systems are 
described and encompass, for example, the expression of proteins 

35 such as RecA or the treatment with PARP inhibitors. It has been 
demonstrated that the intrachromosomal homologous recombination 
in tobacco plants can be increased by using PARP inhibitors 
(Puchta H et al. (1995) Plant J. 7:203-210). Using these 
inhibitors, the homologous recombination rate in the 

40 recombination constructs after induction of the sequence-specific 
DNA double-strand break, and thus the efficacy of the deletion of 
the transgenesequences, can be increased further. Various PARP 
inhibitors may be employed for this purpose. Preferably 
encompassed are inhibitors such as 3-aminobenzamide, 

45 8-hydroxy-2-methylquinazolin-4-one (NU1025), 

1, llb-dihydro-[2H]benzopyrano[4,3,2-de]isoquinolin-3-one (GPI 
6150), 5-aminoisoquinolindne, 
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3 , 4 -dihydro-5 - [ 4 - ( 1 -piper idinyl ) butoxy ] - 

l(2H)-isoquinolinone, or the compounds described in WO 00/26192, 
WO 00/29384, WO 00/32579, WO 00/64878, WO 00/68206, WO 00/67734, 
WO 01/23386 and WO 01/23390. 

5 

In addition, it was possible to increase the frequency of various 
homologous recombination reactions in plants by expressing the E. 
coli RecA gene (Reiss B et al. (1996) Proc Natl Acad Sci USA 
93(7) :3094-3098) . Also, the presence of the protein shifts the 

10 ratio between homologous and illegitime DSB repair in favor of 
homologous repair (Reiss B et al. (2000) Proc Natl Acad Sci USA 
97(7) :3358-3363) . Reference may also be made to the methods 
described in WO 97/08331 for increasing the homologous 
recombination in plants. A further increase in the efficacy of 

15 the recombination system might be achieved by the simultaneous 
expression of the RecA gene or other genes which increase the 
homologous recombination efficacy (Shalev G et al. (1999) Proc 
Natl Acad Sci USA 96(13) :7398-402) . The above-stated systems for 
promoting homologous recombination can also be advantageously 

20 employed in cases where the recombination construct is to be 
introduced in a site-directed fashion into the genome of a 
eukaryotic organism by means of homologous recombination. 

Sequences 

25 

1. SEQ ID NO:l 

Nucleic acid sequence for the I-Scel homing endonuclease. 

2. SEQ ID NO:2 

30 Protein sequence for the I-Scel homing endonuclease. 

3. SEQ ID NO:3 

Nucleic acid sequence for fusion protein of I-Chul homing 
endonuclease and N-terminal nuclear localization sequence. 

35 

4. SEQ ID N0:4 

Protein sequence for fusion protein of I -Chul homing 
endonuclease and N-terminal nuclear localization sequence. 

40 5. SEQ ID N0:5 

Nucleic acid sequence for fusion protein of I-Crel homing 
endonuclease and N-terminal nuclear localization sequence. 

6. SEQ ID NO: 6 
45 Protein sequence for fusion protein of I-Crel homing 

endonuclease and N-terminal nuclear localization sequence. 
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7. SEQ ID NO:7 

Nucleic acid sequence for fusion protein of I -Cpal homing 
endonuclease and N-terminal nuclear localization sequence. 

5 8. SEQ ID N0:8 

Protein sequence for fusion protein of I-Cpal homing 
endonuclease and N-terminal nuclear localization sequence. 

9. SEQ ID NO: 9 

10 Nucleic acid sequence for fusion protein of I-CpaII homing 
endonuclease and N-terminal nuclear localization sequence. 

10. SEQ ID NO: 10 

Protein sequence for fusion protein of I-CpaII homing 
15 endonuclease and N-terminal nuclear localization sequence. 

11. SEQ ID NO: 11: oligonucleotide primer OPN1 

5'-CGG CTC GAG CTA CGG GGA CGA TTT CTT TTT TTC AC- 3 ' 

20 12. SEQ ID NO: 12: oligonucleotide primer OPN2 

5'-CGG CTC- GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC 
TCT AC A GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC 
AT GAA TAC AAA ATA TAA TAA AGA GTT CTT ACT C-3 ' 

25 13. SEQ ID NO: 13: oligonucleotide primer OPN3 

5' -CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC 
TCT ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC 
ATG GAC ATT AAT CCT CAA TGG ATT ACA GG- 3' 

30 14. SEQ ID NO: 14: oligonucleotide primer OPN4 

5' -CGG CTC GAG TTA CTC GCC AGT TTC TTC AAA ACG-3 ' 

15. SEQ ID NO: 15: oligonucleotide primer OPN5 

5' -CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC 
35 TCT ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC 

ATG ACC GAT TCT AAA TCT AGA AAC AAC-3' 

16. SEQ ID NO: 16: oligonucleotide primer OPN6 

5' -CGG CTC GAG CTA AAG GTG GCC TTT ATT GCC ATC AG-3 ' 

40 

17. SEQ ID NO: 17: oligonucleotide primer OPN7 

5 '-CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC 
TCT ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC 
ATG TCA TTA ACA CAA CAA CAA AAA GAC-3 ' 

45 
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18. SEQ ID NO: 18: oligonucleotide primer OPN8 

5' -CGG CTC GAG CTA AAG GTG GCC TTT ATT GCC ATC AG-3 ' 

19. SEQ ID NO: 19: oligonucleotide primer OPN9 

5 5'-CGG CTC TAG AGC GGC CGC CTA GGG ATA ACA GGG TAA TAG AAT 

CCC ACA AAA ATC TGA GCT TAA CAG 3 ' 

20. SEQ ID NO: 20: oligonucleotide primer OPN10 

5' -CGG CTC TAG ACT ATT ACC CTG TTA TCC CTA GGC CCG ATC TAG 
10 TAA CAT AGA TGA CAC CGC GCG CG 3 ' 

21. SEQ ID NO: 21: oligonucleotide primer OPN11 

5'- CGG AAG CTT CGT CAC CAA TCC CAA TTC GAT CTA C - 3' 

15 22. SEQ ID NO: 22: oligonucleotide primer OPN12 

5'- CGG AAG CTT CCA CTT GCA AAG TCC CGC TAG TGC C - 3' 

23. SEQ ID NO: 23: oligonucleotide primer OPN13 

5'- CGG AAG CTT CGT CAC CAA TCC CAA TTC GAT CTA' C - 3' 

20 

24. SEQ ID NO: ■ 24: oligonucleotide primer OPN14 

5 ' - CGG AAG CTT CCA CTT GCA AAG TCC CGC TAG TGC C - 3 ' 

25. SEQ ID NO: 25: oligonucleotide primer OPN15 

25 5'- CTA GTA CAA AAC GTC GTG AGA CAT TTT AAT CTG AAG GTT TGG 

CAC CTC GAT GTC GGC TCA TC-3 ' 

26. SEQ ID NO: 26: oligonucleotide primer OPN16 

5' -CTA GGA TGA GCC GTC ATC GAG GTG CCA AAC CTT CAG ATT AAA 
30 ATG TCT CAC GAC GTT TTG TA-3 ' 

27. SEQ ID NO: 27: oligonucleotide primer OPN17 

5 ' -CTA GTC CGA AAA CGC CGT GAG ACA TAT TGG TTA CGA TCC TAA 
GGT AGC GAA ATT CAC CCG GTA ACT CTG TGC CAG- 3 ' 

35 

28. SEQ ID NO: 28: oligonucleotide primer OPN18 

5' -CTA GCT GGC ACA GAG TTA CCG GGT GAA TTT CGC TAC CTT AGG 
ATC GTA ACC AAT ATG TCT CAC GGC GTT TTC GGA- 3 ' 

40 29. SEQ ID NO: 29: nuclear localization sequence NLS1 
N-Pro-lys-Thr-Lys-Arg-Lys-Val-C 

30. SEQ ID NO: 30: nuclear localization sequence NLS2 
N-Pro-Lys-Lys-Lys-Arg-Lys-Val-C (SEQ ID NO: 30) 

45 
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Figures 

The following abbreviations apply to the figures in general: 

5 HI: Homology sequence A 

H2: Homology sequence B 

Hl/2: Sequence as the result of homologous recombination of HI 
and H2 

SI: First recognition sequence for the site-directed 

10 induction of DNA double-strand breaks 

S2: Second recognition sequence for the site-directed 

induction of DNA double-strand breaks 
E: DSBI enzyme 

P: Promotor or other genetic control element 

15 N: Further nucleic acid sequence 

NS: Negative selection marker 

PS: Positive selection marker 

Tl: Front part, for example of a gene or open reading frame 

T2: Back part, for example of a gene or open reading frame 

20 STOP: Interruption of a gene or open reading frame by, for 
example, stop codons or reading-frame shifts. 

Fig, 1: Diagram of the principle of the invention 

Sequences in the genome can be eliminated efficiently 

25 when they are flanked by the homology sequences HI and H2 

and when a cleavage site (SI) for a DSBI enzyme is 
located between the homology sequences • Owing to the 
action of the DSBI enzyme (E) on this recombination 
cassette (H1-S1-H2), double-strand breaks are formed at 

30 the cleavage site SI and the sequences located between HI 

and H2 are eliminated. 

Fig. 2: Preferred embodiment . 

Sequences — in the present case for example an expression 

35 cassette consisting of a promotor (P) and a further 

nucleic acid sequence (N) to be expressed (for example a 
selection marker) - can be eliminated efficiently from 
the chromosomal DNA when the are flanked by the homology 
sequences Hi and H2 and when a cleavage site (SI) for a 

40 DSBI enzyme is located between the homology sequences. 

Owing to the action of the DSBI enzyme (E) on this 
recombination cassette (H1-S1-P-N-H2 ) , double-strand 
breaks are formed at the cleavage site SI and the 
sequences located between Hi and H2 are eliminated. The 

45 cleavage site SI may also be located behind or within the 

expression cassette. 
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Fig. 3: Preferred embodiment 

Sequences — in the present case for example an expression 
cassette consisting of a promotor (P) and a further 
nucleic acid sequence (N) to be expressed (for example a 
5 selection marker) — can be eliminated particularly 

efficiently from the chromosomal DNA when they are 
flanked by the homology sequences Hi and H2 and when in 
each case one cleavage site (SI and S2) for a DSBI enzyme 
is located before and after the nucleic acid sequence to 
10 be deleted. Owing to the action of the DSBI enzyme (E) on 

this recombination cassette (H1-S1-P-N-S2-H2) , 
double-strand breaks are formed at the cleavage sites SI 
and S2 and the sequences located between Hi and H2 are 
eliminated. 

15 

Fig. 4: Preferred embodiment 

Sequences — in the present case for example an expression 
cassette consisting of a promotor (P) and a further 
nucleic acid sequence (N) to be expressed (for example a 

20 selection marker) can be eliminated virtually without 

trace from the chromosomal DNA when the recombination 
construct comprising them has previously been inserted 
into the host genome, for example by homologous 
recombination. In doing so, the gene consisting of the 

25 sequence segments Tl, Hl/2 and T2 is interrupted. The 

recombination construct is flanked by two parts of the 
interrupted gene (Tl-Hl or H2-T2), the middle part (HI or 
H2) having been duplicated in order to enable homologous 
recombination to take place. The action of the DSBI 

30 enzyme (E) on the cleavage sites (SI and S2) induces 

double-strand breaks and induces the homologous 
recombination between the homology sequences HI and H2, 
whereby firstly the sequences located between HI and H2 
are deleted and secondly the original gene is restored. 

35 

Fig. 5: Preferred embodiment 

Nucleic acid sequences (in the present case a gene with 
the sequence T1-H1/2-T2 under the control of a promotor - 
P) can be expressed inducibly by reconstituting the 

40 intact gene only by applying the recombination system. 

The gene, consisting of the sequence segments Tl, Hl/2 
and T2 is inactivated, for example by the insertion of 
stop codons or other interruptions of the reading frame 
within the recombination construct. The recombination 

45 construct is flanked by two parts of the interrupted 

gene (Tl-Hl or H2-T2), the middle part (HI or H2) having 
been duplicated in order to enable homologous 
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recombination to take place. The action of the DSBI 
enzyme (E) on the cleavage sites (SI and S2) induces 
double-strand breaks and induces the homologous 
recombination between the homology sequences HI and H2 , 
5 whereby firstly the sequences located between HI and H2 

are deleted and secondly the intact gene is restored. 

Fig. 6: Preferred embodiment 

The figure shows a method which is identical to that 
10 described in Fig. 5, only that in the present case an 

endogenous gene is to be activated in a site-directed 
manner by introducing the recombination construct for 
example by means of homologous recombination. 

15 Fig. 7a: Exemplary embodiment 

The figure illustrates a specific embodiment of the 
method described in Fig. 6. A recombination construct is 
introduced via agrobacterium-mediated trans feet ion. 
Flanked by the right and left border sequence ( RB and LB, 

20 respectively), the construct comprises the interrupted 

reading frame of the GUS gene (^-glucuronidase) under the 
control of the 35S promotor (P) and the nopaline sythase 
[sic] (nos) terminator. The middle region of the GUS gene 
(U) was duplicated and constitutes the homology sequences 

25 A and B. Located between these sequences is the codA gene 

as negative selection marker under the control of the 
Cauliflower Mosaic Virus (CaMV) 35S promotor and the 
nopaline synthase (nos) terminator, flanked by two 
recognition sequences of the DSBI enzyme (SI and S2). The 

30 recombination construct furthermore additionally 

comprises the BAR gene under the control of the 35S 
promotor (P) and the 35S terminator, as positive 
selection marker. 

35 Fig. 7a illustrates the appearance of double-strand breaks 

andi the homologous recombination between the homologous U 
sequences , brought about by the action of the DSBI 
enzyme, whereby firstly the sequences located between the 
homologous U sequences are deleted and secondly the GUS 

40 gene is restored. The length of the Acc65I fragment is 

thus reduced from 7.3 kb to 3.7 kb. 

Fig. 7b: Shows the same system as described under Fig. 7a. Fig. 7a 

illustrates the appearance of double-strand breaks as the 
45 result of the action of the DSBI enzyme. In contrast to 

Fig. 7a, no homologous recombination takes place in the 
present case, but illegitimate recombination by 
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non-homologous end- joining. While the region between SI 
and S2 is deleted owing to the two cleavage sites, the 
GUS gene is not restored. The length of the Acc65I 
fragment is thus reduced from 7.3 kb to 4.4 kb. 

5 

Fig. 7c: The figure is another representation of the two end 
products of the processes described under Fig. 7a and 
Fig. 7b. 

A: Result of the homologous recombination; Acc65I 
10 fragment has a length of 3.7 kb; the size of the 

fragment amplified with the aid of the primers 0PN13 
and 0PN14 (shown by the arrows) is 0.7 kb. 
B: Result of the illegitime recombination 

(non-homologous end- joining ) ; Acc65I fragment has a 
15 length of 4.4 kb; the size of the fragment amplified 

with the aid of the primers OPN 13 and 0PN14 (shown by 
the arrows) is 1.4 kb. 

Fig. 8: Preferred embodiment 

20 The recombination cassettes advantageously encompass both 

a positive and a negative selection marker (PS and NS, 
respectively), in each case under the control of a 
promotor. The positive selection marker is useful for 
facilitating and detecting the introduction of the 

25 construct into the genome. The negative selection marker 

is useful for detecting the deletion of the construct 
from the genome. Both markers are eliminated efficiently 
from the chromosomal DNA when they are flanked by the 
homology sequences Hi and H2 and when in each case one 

30 cleavage site (SI and S2, respectively) for a DSBI enzyme 

is located before and/or after the nucleic acid sequence 
to be deleted. Owing to the effect of the DSBI enzyme (E) 
on this recombination cassette, double-strand breaks 
appear at the cleavage sites SI and/or S2 and the 

35 sequences located between HI and H2 are then eliminated. 

The effect of one of the abovementioned DSBI enzymes 
brings about site-directed double-strand breaks and 
induces the homologous recombination between the 
40 homologous U sequences , whereby firstly the sequences 

located between the homologous U sequences are deleted 
and secondly the GUS gene is restored. 



45 
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Fig. 9: Readily selectable systems for deleting a nucleic acid 
sequence from the chromosomal DNA of an organism. The 
constructs comprise a positive selection marker (PS) and 
negative selection marker (NS) , in each case under the 
5 control of a promotor ( P ) . 

(B) additionally comprises an expression cassette for the 
DSBI enzyme, expression preferably taking place under the 
control of an inducible promotor (Pi). 

(C) Further nucleic acid sequences may be present. 

10 Expression of the DSBI enzyme leads in all cases to the 

elimination of the DNA sequences located between the two 
recognition sequences and to the recombination of the 
homologous sequences. Since the cells simultaneously lose 
a negative selection marker f the cells where successful 

15 deletion has taken place can be identified by means of 

selection (Gleave AP et al.(1999) Plant Mol Biol. 
40:223-235). 

Fig. 10: The figure illustrates the two constructs ("SI construct 

20 (A) and SD construct (B)) which was [sic] used for 

proving that homologous recombination by double-strand 
breaks can be induced with different restriction enzymes. 
The constructs are introduced via agrobacterium-mediated 
transfection. The constructs, which are flanked by the 

25 right and left border sequence (RB and LB, respectively) 

contain the interrupted reading frame of the GUS gene 
(p-glucuronidase) under the control of the 35S promotor 
(P) and the nopal ine synthase (nos) terminator. The 
middle region of the GUS gene (U) was duplicated and 

30 constitutes the homology sequences A and B. Located 

between these sequences are, in the case of the SI 
construct (A) , the recognition sequences. of the DSBI 
enzymes I-Scel, I-Cpal, I-CpaII and I-Crel, and, in the 
case of the SD construct (B), the recognition sequence of 

35 the I-Chul enzyme. The recombination constructs 

furthermore additionally contain the BAR gene under the 
control of a promotor (P) as positive selection marker. 

Fig. 11: Representative histochemical analysis of tobacco calli 
40 obtained after the induction of double-strand breaks. A 

blue coloration (here shown as dark coloration) indicates 
the expression of the JJ-glucuronidase gene, and thus the 
elimination of the selection marker by homologous . 
recombination. Blue (dark colorations) can be seen in the 
45 case of the calli in the wells A2, AS , A6, B2, CI, C6 and 

D2. 
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Fig. 12: PCR analysis for detecting homologous recombination. PCR 
on DNA from tobacco calli using the primers OPN13 and 
OPN14. 

5 The PCR product (size 0.7 kb) which indicates homologous 

recombination can be seen in lanes 1 , 2 and 3 . The 
corresponding calli were blue following histochemical 
staining, and the corresponding PCR bands were sequenced 
in-order to demonstrate that the open reading frame (ORF) 
10 of B-glucuronidase was indeed restored by homologous 

recombination. 

Lanes 4 and 5: PCR products (1.4 kb) of calli which did 
not turn blue upon staining, where the transgene was 
15 eliminated by non-homologous end-joning. 

Fig. 13: Southern blots which indicate the complete elimination of 
the transgene sequence in question. 
The lanes of blots A to D comprise in each "case: 

20 



25 



30 



35 



Lane 


Line 






Description 


1 


GU.C.USB 


1 




Original line 


2 


GU.C.USB 


1- 


-61 


Non-homologous end- joining 


3 


GU.C.USB 


!• 


-83 


Homologous recombination 


4 


GU.C.USB 


3 




Original line 


5 


GU.C.USB 


3< 


-1 


Non-homologous end- joining 


6 


GU.C.USB 


3- 


-3 


Homologous recombination 


7 


GU.C.USB 


7 




Original line 


8 


GU.C.USB 


7- 


-14 


Non-homologous end-joining 


9 


GU.C.USB 


7- 


-34 


Homologous recombination 



A: Hindlll-digested DNA hybridized with a 
B-glucuronidase-specif ic sample [sic]. 

B: Hindlll-digested DNA hybridized with a codA-specif ic 
sample [sic] . 



C: Acc65l-digested DNA hybridized with a 
40 B-glucuronidase-specif ic sample [sic]. 

D: Acc65l-digested DNA hybridized with a codA-specif ic 
sample [sic]. 

45 The analysis demonstrates that f following the 

induction of DNA double-strand breaks by means of 
expression of the restriction enzyme, not only with 
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[sic] homologous recombination (lanes 3, 6 and 9), 
but also with [sic] illegitimate recombination (lanes 
2,5 and 8) may occur/ the transgene sequence (codA) 
located between the restriction cleavage sites always 
5 having been eliminated from the plant genome. 

Examples 

General methods: 

10 

The chemical synthesis of oligonucleotides can be effected for 
example in the known manner using the phosphoamidite method 
(Voet, Voet, 2nd edition, Wiley Press New York, pages 896-897). 
The cloning steps carried out for the purposes of the present 

15 invention, such as, for example, restriction cleavages, agorose 
[sic] gel electrophoresis, purification of DNA fragments, the 
transfer of nucleic acids to nitrocellulose and nylon membranes, 
the linkage of DNA fragments, the transformation of E. coli 
cells, bacterial cultures, the propagation of phages and the 

20 sequence analysis of recombinant DNA are carried out as described 
by Sambrook et-al. (1989) Cold Spring Harbor Laboratory Press; 
ISBN 0-87969-309-6. Recombinant DNA molecules were sequenced 
using an ALF Express laser fluorescence DNA sequencer (Pharmacia, 
Upsala [sic], Sweden) following the method of Sanger (Sanger et 

25 al., Proc. Natl. Acad. Sci. USA 74 (1977), 5463-5467). 

Example 1: Cloning of the homing endonucleases 

The open reading frames (ORFs) of the homing endonucleases I-Crel 
30 (Wang J et al. (1997) Nucleic Acids Res 25: 3767-3776), I-Chul 
(Cote V et al.(1993) Gene 129:69-76), I-Cpal (Tunnel M et al. 
(1995a) Nucleic Acids Res 23:2519-2525) and I-CpaII (Turmel M 
et al. (1995b) Mol. Biol. Evol. 12, 533-545) were cloned from the 
corresponding Chlamydomonas strains. 

35 

To ensure optimal translation of the gene, the ORFs of the 
endonucleases were linked to the leader sequence of a plant virus 
(CaMV gene V, as has proven useful in the case of I-Scel; Puchta 
H (1993) Nucl Acids Res 21:5034-5040). Also, a nuclear 
40 localization sequence (NLS2; SEQ ID NO: 30) was placed in front 
of the ORFs in order to efficiently transport the protein to the 
intended site of action. The two elements (leader sequence and 
nuclear localization sequence) were introduced via PCR by means 
of the oligonucleotide primers used. 
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To isolate the open reading frames (ORFs) of the endonucleases 
from Chlamydomonas, the algal cultures Chlamydomonas 
reinhardtii/ Smith (strain no, ll-32b), Chlamydomonas 
applanata/Lucksch (strain no.: 11-9.) and Chlamydomonas 
5 segris/King (strain no.: 9.83) were obtained from the algal 
culture collection at Gottingen (University of Gottingen, 
experimental phycology and collection of algal cultures, 
Albrecht-von-Haller Institute for Plant Sciences, Untere Karspule 
2, D-37073 Gottingen). The cultures were grown with the aid of a 
10 shake culture in MS medium, and DNA was obtained using the DNeasy 
Plant Maxi Kit (Qiagen, Hilden). 

The I-Crel ORF (GenBank Acc. No.: X01977) was amplified from a 
sample of the algal culture ll-32b Chlamydomonas 
15 reinhardtii/Smith with the aid of the oligonucleotides OPN1 and 
OPN2 (SEQ ID NO: 11 and 12). 

OPN1 (SEQ ID NO: 11) : 

5' -CGG CTC GAG CTA CGG GGA CGA TTT CTT TTT TTC AC- 3' 

20 

OPN2 (SEQ ID NO: 12): 

5'- CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC TCT 
ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC AT GAA 
TAC AAA ATA TAA TAA AGA GTT CTT ACT C 3' 

25 

2 fil (corresponding to approximately 100 ng DNA) of the DNA 
preparation were employed in the PCR reaction. The following were 
combined in a total volume of 50 |xl in accordance with the 
manufacturer's instructions (Life Technologies): 

30 

5 \il 10X PCR buffer [200 mM Tris-HCl (pH 8.4), 500 mM KC1] 
1.5 nl 50 mM MgCl 2 

1 \il 10 mM dNTP mix (10 mM of each dATP, dCTP, dGTP and dTTP) 

1 jxl primer OPN1 (10 nM) 
35 1 jxl primer OPN2 (10 pM) 

0.4 \xl Taq DNA polymerase (5 U/jil) 

2 (xl DNA preparation 

38 • 1 }iX autoclaved distilled water 

40 The reaction mixture is covered with approx. 50 \il of silicone oil 
and subjected to the following temperature program (Thermocycler : 
MWG Biotech Primus HT; MWG Biotech, Germany) : 

1 cycle of 180 sec at 95°C 
45 30 cycles of 60 sec at 92°C, 60 sec at 54°C and 3 min at 

72°C 



0817/00023 



CA 02451492 2003-12-18 



56 

1 cycle of 5 min at 72°C. 

The PCR fragment was purified via agarose gel electrophoresis 
using the QIAquick® Gel Extraction Kit (Qiagen, Hilden, Germany) 
5 and cloned into the pGEM-T Easy vector (Promega, Madison, USA). 
Next, a sequence analysis is carried out using the ALF-Express 
DNA sequencer (Pharmacia, Upsala, Sweden). The sequence is shown 
in SEQ ID NO: 5. 

10 Cloning of the ORF of I-Cpal from the algal culture 9.83 

Chlamydomonas segris/King (Genbank Acc. No.: L36830) was carried 
out analogously to the description given for I-Crel. The 
oligonucleotides 0PN3 and 0PN4 were used for the PCR. The 
sequence is shown in SEQ ID NO: 7. 

15 

0PN3 (SEQ ID NO: 13): 

5'-CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC TCT 
ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC ATG GAC 
ATT AAT CCT CAA TGG ATT ACA GG- 3' 

20 

OPN4 (SEQ ID NO: 14): 

5'-CGG CTC GAG TTA CTC GCC AGT TTC TTC AAA ACG-3 ' 

Cloning the ORF of I-Cpall was also carried out analogously as 
25 described for I-Crel (Genbank Acc. No: L39865). A sample of the 
algal culture 9.83 Chlamydomonas segris/King was used for this 
purpose. The oligonucleotides OPN5 and OPN6 were used for the 
PCR. The sequence is shown in SEQ ID NO: 9. 

30 OPN5 (SEQ ID NO: 15) : 

5'-CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC TCT 
ACA GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC ATG ACC 
GAT TCT AAA TCT AGA AAC AAC-3 ' 

35 0PN6 (SEQ ID NO: 16): 

5'-CGG CTC GAG CTA AAG GTG GCC TTT ATT GCC ATC AG-3' 

Cloning of the ORF of I-Chul from the algal culture 11-9 
Chlamydomonas applanata/Lucksch (Genbank Acc. No.: L06107) was 
40 carried out analogously to the description given for I-Crel. The 
oligonucleotides 0PN7 and 0PN8 were used for the PCR. The 
sequence is shown in SEQ ID NO: 3. 



45 
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OPN7 (SEQ ID NO: 17): 

5'-CGG CTC GAG TAC CTA GAA TAC AAA GAA GAG GAA GAA GAA ACC TCT ACA 
GAA GAA GCC ATG GGT CCA AAG AAA AAG AGA AAG GTT ATC ATG TCA TTA 
ACA CAA CAA CAA AAA GAC-3 ' 

5 

OPN8 (SEQ ID NO: 18): 

5 ' -CGG CTC GAG CTA AAG GTG GCC TTT ATT GCC ATC AG-3 ' ) 

The ORF of the individual homing endonucleases (with the nuclear 
10 localization signal) was in each case excised from the pGEM-T 
Easy vector by means of restriction digestion with Sail, purified 
by gel electrophoresis and in each case cloned into the Sail 
restriction cleavage site of the binary vector pBinAR (Hofgen and 
Willmitzer (1990) Plant Science 66:221-230) . Expression of the 
15 individual enzymes takes place under the control of the 35S 
promotor and the octopine synthase terminator. 

The binary I-Scel expression vector pClScel (Puchta H et al. 
(1996) Proc. Natl. Acad. Sci. USA 93:5055-5060) contains a 
20 synthetic I-Scel ORF under the control of the CaMV 35S promotor 
(Puchta H et al. (1993) Nucl Acids Res 21: 5034-5040) between the 
T-DNA borders . 

All of the five plasmids were multiplied in E. coli, purified by 
25 means of the QIAfilter Plasmid Midi kit (Qiagen, Hilden) and 
transferred into the agrobacterial strain C58 by means of 
electroporation . 

Example 2: Generation of the construct pGU.I.USB 

30 

The plasmid pGU.US (Tinland B et al. (1994) Proc. Natl. Acad. 
Sci. USA 91:8000-8004) was used for constructing the 
recombination substrates, within the T-DNA region, the plasmid 
contain two overlapping halves of the B-glucuronidase (GUS) gene 
35 with a 557 bp overlap. A hygromycin gene is integrated in a 
unique Xbal cleavage site between the GUS sequences. 

In a first step, the BAR gene together with promotor and 
terminator sequences was excised from the vector pRC (Puchta H et 

40 al. (1996) Proc Natl Acad Sci USA 93:5055-5060) in the form of an 
isolated Hindlll fragment, separated from the vector sequence via 
agarose gel electrophoresis, excised from the gel, isolated with 
the aid of the QIAquick® Gel Extraction Kit (Qiagen, Hilden, 
Germany) and thereafter inserted into the unique Hindlll cleavage 

45 site of pGU.US. To this end, the vector pGU.US was previously cut 
with Hindlll and dephosphorylated with alkaline phosphatase (calf 
intestinal alkaline phosphatase (CIP), New England Biolabs, 
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Frankfurt, Germany) in order to prevent recircularization. The 
resulting vector is termed pGU. US-BAR. 

In the vector pNE3 (Stougaard J (1993) Plant J 3:755-761), the 
5 Xbal cleavage site was first removed by a Klenow filling-in 
reaction. The open reading frame (ORF) of the negative selection 
marker gene cytosine deaminase (codA) under the control of the 
Cauliflower Mosaic Virus (CaMV) 35S promotor and of the nopaline 
sythase [sic] (nos) terminator was amplified from the resulting 

10 vector pNE3-XBA by means of PCR using the oligonucleotide primers 
ONP9 (SEQ ID NO: 16) and ONP10 (SEQ ID NO: 17). Owing to the 
oligonucleotide primers OPN9 and OPN10 used, in each case one 
I-Scel cleavage site (emphasized in bold in the sequences stated 
hereinbelow) and an Notl or Xbal cleavage site were added to the 

15 two ends of the amplificate. 

OPN9 (SEQ ID NO: 19): 

5 ' -CGG CTC TAG AGC GGC CGC CTA GGG ATA ACA GGG TAA TAG AAT CCC 
ACA AAA ATC TGA GCT TAA CAG 3' 

20 

OPN10 (SEQ ID NO: 20): 

5' -CGG CTC TAG ACT ATT ACC CTG TTA TCC CTA GGC CCG ATC TAG TAA 
CAT AGA TGA CAC CGC GCG CG 3 ' 

25 2 ill (corresponding to approximately 100 ng) of a plasmid 

preparation of pNE3-XBA were employed for the PCR reaction* The 
following were combined in a total volume of 50 fxl in accordance 
with the manufacturer's instructions (Life Technologies): 

30 5 jil 1 OX PCR buffer [200 mM Tris-HCl (pH 8.4), 500 mM KC1] 

1.5 >1 50 mM MgCl 2 

1 nl 10 mM dNTP mix (10 mM of each dATP, dCTP, dGTP and dTTP) 
1 |il primer OPN1 (10 [AM) 
1 \il primer OPN2 (10 fiM) 
35 0.4 |il Taq DNA polymerase (5 U/(il) 

.2 \il plasmid preparation of pNE3-XBA 
38.1 |il autoclaved distilled water 

The reaction mixture is covered with approx. 50 ]il of silicone oil 
40 and subjected to the following temperature program (Thermocycler : 
MWG Biotech Primus HT; MWG Biotech, Germany) : 

1 cycle of 180 sec at 95°C 
30 cycles of. 60 sec at 92°C, 60 sec at 54°C and 3 min at 
45 72°C 

1 cycle of 5 min at 72°C. 
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The PCR product was digested with Xbal and Notl. The vector 
pGU-US-BAR was likewise digested with Xbal and Notl (which 
resulted in the deletion of the hygomycin marker gene), and the 
vector fragment was purified by agarose gel electrophoresis using 
5 the QIAquick® Gel Extraction Kit (Qiagen, Hilden, Germany). 

Ligation of the digested PCR fragment and vector gave rise to the 
binary vector pGU.C.USB (see Fig. 7a). The vector contains a 
marker gene (cytosine deminase (codA)) on a T-DNA between two 
I-Scel cleavage sites. The I-Scel cleavage sites are outwardly 

10 flanked by homologous sequence regions 557 bp in size of the 
^-glucuronidase gene (GUS). The GUS gene acts as homologous 
restoration marker (Swoboda P et al. (1994) EMBO J 13:481- 489). 
If the gene is restored by homologous recombination , the 
expression can be detected histochemically. Elimination of the 

15 marker gene gives rise to 5-FC ( fluorocytosine) -resistant tobacco 
cells/ which can then regenerate it to give calli (Salomon S and 
Puchta H (1998) EMBO J 17:6086-6095) . 

Example 3: Plant transformation with pGU.I.USB 

20 

Nicotiana tabacum L. cv. Petite Havana Line SRI seedlings were 
transformed with the agrobacterial strain C58, which contained 
the binary vector pGU.C.USB. 

25 To this end, seeds were placed on moistened filter paper under 
sterile conditions and the seedlings were harvested after 2 
weeks, all as described by Puchta H. (1999) Methods Mol Biol 113: 
447-451 (25°C, 16 hours light/8 hours dark rhythm). 

30 For the inoculation, the agrobacteral strain containing the 
binary transformation plasmid was first grown overnight in a 
shake culture at 28°C in YEB medium. Then, the agrobacterial 
suspension was centrifuged for 10 minutes at 15.000 g and the 
cells were taken up in in 10 mM MgS04 so that the final optical 

35 density of the suspension had a value of approximately 0.5. In a 
reaction vessel, the seedlings were then placed into the 
bacterial solution under sterile conditions and a vacuum of 
0.15 at was applied in a sterile desiccator. After 10 minutes, 
the seedlings were then placed on MS plates supplemented with BAP 

40 (6-benzylaminopurine 5 jig/ml) and NAA ( 1-naphthaleneaceteic acid 
0.5 ng/ml) and left for 3 days in a growth cabinet (25°C, 16 hours 
light/ 8 hours dark rhythm) . The seedlings were then placed on MS 
medium supplemented additionally with phosphinothricin 
(100 jxg/ml), vancomycin (1 jxg/ml) and cefotaxim (0.5 n.g/ml) in 

45 addition to NAA and BAP. Every 10 days, the seedlings were 
transferred to freshly made plates. Eventually, the resulting 
calli formed shoots. As soon as the shoots had attained a certain 
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size (1 to 2 cm), they were excised from the callus material and 
planted in magenta boxes comprising MS medium supplemented with 
phosphinothricin, vancomycin and cefotaxin (concentrations as 
above). After a short time, the shoots developed roots; they were 
5 transferred into soil after 2 to 4 weeks. The plants were made to 
flower in the greenhouse and were then selfed, and the seeds 
formed were left to ripen in the capsules. The seeds were then 
placed on MS medium supplemented with 300 ng of phosphinothricin 
(for the positive selection) or 500 \ig of 5-FC (f luorocytosin; for 

10 the negative selection) per ml in order to carry out the 

segregation analyses. By determining the ratio of resistant to 
sensitive seedlings (3:1 in the case of positive selection and 
1:3 in the case of negative selection) , it was possible to 
demonstrate that the recombination constructs were inserted at a 

15 locus in the three selected lines. 

Example 5: Induction of gene deletion by introducing the DSBI 
enzyme I-Scel 

20 Ih the experiments, Fl seedlings of the transgenic lines GU.C.USB 
1, 3 and 7, each of which comprises a copy of the T-DNA GU.C.USB 
shown in Fig. 2, were inoculated with an agrobacterial strain 
which transiently expressed I-Scel and which comprised the 
plasmid pCIScel (Puchta H et al. (1996) Proc Natl Acad Sci USA 

25 93, 5055-5060) in the abovedescribed manner (see also Puchta, 
1999b). After 3 days, the seedlings were plated on MS medium 
supplemented with BAP and NAA (concentrations as above) medium to 
the same medium additionally in the presence of 100 \iq of 5-FC and 
100 ng of phosphinothricin per ml incubated in order to detect 

30 plant cells in which the marker gene to be eliminated (in this 
case the codA gene) was deleted. After 6 weeks, the calli growing 
on the medium were divided into two, and one part was used for 
the regeneration of shoot axes while the other was used for 
isolating DNA and for the B-glucuronidase assay. The resulting 

35 5-FC-resistant transgenic calli were analyzed for homologous 
recombination events by means of histochemical staining. A blue 
staining indicated restoration of the callus (see Fig. 11). 

The histochemical staining of the calli was carried out as 
40 described by Swoboda et al., 1994. To this end, the calli were 
introduced into staining solution (0.3 mg X-Gluc [Duchefa, 
Harlem, Nl] per ml of 100 mM sodium phosphate buffer pH 7.0; 0.1% 
Triton; 0.05% NaN3 ) • A vacuum was applied for 15 minutes to the 
desiccator , and the calli were subsequently incubated in the 
45 solution for 48 hours at 37°C. After the staining solution was 
poured off, the remaining chlorophyll was removed from the plant 
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material by repeated shaking in 80% ethanol. The blue staining 
obtained indicated the fl-glucuronidase activity. 

In approximately one quarter of the cases, the marker gene was 
5 eliminated successfully by homologous recombination (Fig. 11, 
Table 2). 

Table 2. Number of 5-FC -resistant tobacco calli following 
transient DSB induction 

10 



15 



Transgenic 
line 


Seedlings 


resis- 
tant 
calli 


GUS-positive 


GUS-positive 
(% of 
resistant 
calli) 


GU.C.USB 1 


290 


56 


22 


39 


GO. C. USB 3 


490 


90 


24 


27 


GU.C.USB 7 


370 


59 


11 


19 



Molecular analyses confirm the facts: since the line GU.C.USB 1 
contained a single copy of the transgene, the calli were analyzed 
directly for recombination results by means of PCR. 

A random fraction of calli was then analyzed at the molecular 
level by means of PCR. Molecular analysis with the primer pairs 

OPN11 (SEQ ID NO: 21) 

5'- CGG AAG CTT CGT CAC CAA TCC CAA TTC GAT CTA C - 3' and 
OPN12 (SEQ ID NO: 22) 

5'- CGG AAG CTT CCA CTT GCA AAG TCC CGC TAG TGC C - 3' 

allowed the isolation of the newly-formed linkage sites from the 
tobacco genome (Fig. 12; Table 3). 

Table 3. Molecular analysis of recombination events by means of 
PCR 



40 



Transgenic 
line 


Calli 


PCR fragment (s) ! 
0.7 kb 1.4 kb none/other 


GU.C.USB 1 


30 


10 12 7 



45 
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Three 0.7 kb PCR fragments were selected and sequenced. In all 
three cases, sequencing confirmed the functional sequence of the 
fl-glucuronidase gene, i.e. the restoration of the gene did indeed 
take place accurately by homologous recombination. 

5 

When five 1-4 kb PCR bands were sequenced, it was found that 
these bands were formed after excision of the codA gene by 
reparation of the two I-Scel cleavage sites (by non-homologous 
end- joining, NHEJ) without homologous recombination taking place. 
10 In most cases, minor deletions at the I-Scel cleavage site 
resulted. 

Southern blots demonstrated that, as expected, complete 
elimination of the sequence located between the I-Scel cleavage 
15 sites took place in the recombinants with the 0.7 and 1.4 kb 
bands, respectively. No codA-specif ic DNA whatsoever was 
detectable any longer in the genome of the regenerated plants 
(Fig. 13 B and D, lanes 2 and 3). 

20 The DNA was isolated with the aid of the DNeasy Plant Mini Kit 
(Quiagen, Hilden) . To detect the recombination products, genomic 
DNA was analyzed by means of PCR using the oligonucleotides OPN13 
ahdOPN14.. 

25 OPN13 (SEQ ID NO: 23): 

5 ' - CGG AAG CTT CGT CAC CAA TCC CAA TTC GAT CTA C - 3 ' 

OPN14 (SEQ ID NO: 24): 

5'- CGG AAG CTT CCA CTT GCA AAG TCC CGC TAG TGC C - 3' 

30 

5 *il 10X PCR buffer [200 mM Tris-HCl (pH 8.4), 500 mM KC1] 
1.5 fil 50 mM MgCl 2 . 

1 (il 10 mM dNTP mix (10 mM of each dATP, dCTP, dGTP and dTTP) 

1 [il primer OPN1 (10 jiM) 
35 1 ill primer OPN2 (10 \M) 

0.4 \il Taq DNA polymerase (5 U/(il) 

2 \xl DNA preparation 

38.1 jil autoclaved distilled water 

40 The reaction mixture is covered with approx. 50 fil of silicone oil 
and subjected to the following temperature program (Thermocycler : 
MWG Biotech Primus HT; MWG Biotech, Germany): 

1 cycle of 180 sec at 95°C 
45 30 cycles of 60 sec at 92°C, 60 sec at 54°C and 3 min at 

72°C 
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1 cycle of 5 min at 72°C. 

The PCR products were sequenced using the "ABI Prism Dye 
Terminator Cycle Sequencing Reaction Kit" (PE Applied Biosystems, 
5 Weiterstadt) 

For the Southern blotting, the DNA was cut with Hindlll or Acc65I 
and subjected to electrophoresis in a 0.8% agarose gel. The DNA 
in the gel was then transferred to the hybridization membrane 

10 'Hybond N' (Amersham f Little Chalfont, UK) by means of capillary 
blotting as described in the manufacturer's instructions. For the 
molecular hybridization, codA- or GUS-specific gene fragments 
were isolated from the starting plasmids (Xbal/Xhol fragment as 
PNE3; Stougaard, 1993, and KpnI/SacI fragment frompGUS23, Puchta 

15 and Hohn, 1991, isolated using the QIAquick Gel Extraction Kit 
[Qiagen, Hilden] ) and labeled with the aid of a "Random Priming 
Labeling Kit" (Megaprime DNA labeling system RPN1607, Amersham, 
Little Chalfont, UK) and [ct- 3 2p]dATP (Amersham, Little Chalfont, 
UK). The hybridization reactions were carried out at 65° C. 

20 

Since in each case 2 genetically linked transgene copies were 
integrated in "the case of lines GU.C.USB 3 and GU.C.USB 7, a 
representative number of plants was regenerated from callus in 
the case of these lines, DNA was obtained and then analyzed per 
25 Southern blot (Table 4). 

In the case of Acc65I, the presence of a GUS-specific 3.7 kb band 
suggests a homologous recombination, while a 4.4 kb band suggests 
an NHEJ event ( "non-homologous end- joining" ; NHEJ) (Fig. 7b and 
30 c; Fig. 13 C) . 

Table 4. Molecular analysis of recombination events by means of 
Southern blots 



35 



Transgenic line 


Calli 


Acc 651 fragment (kb) 
3.7 4.4 Deletion 


GU.C.USB 3 


39 


6 18 15 


GU.C.USB 7 


14 


2 5 7 



40 

Interestingly, the same type of linkage in the two transgene 
copies was found in all cases. In other words, either only 
homologous recombinations or only NHEJ events occurred. In no 
case did both possibilities exist in parallel, i.e. for example a 
45 homologous recombination in the one transgene and an NHEJ event 
in the other. 
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In both lines, PCR analyses were also carried out, and in each 
case three 0.7 kb PCR fragments were selected and sequenced. In 
all three cases, sequencing revealed the functional sequence of 
the B-glucuronidase gene, i.e. the restoration of the gene did 
5 indeed occur by means of homologous recombination. 

When a total of nine 1.4 kb PCR bands of the two lines were 
sequenced, it was furthermore found that these bands indeed 
originated after excision of the codA gene by repair of the two 
10 I-Scel cleavage sites (by "non-homologous end-joining" NHEJ) . 
Again, minor deletions resulted at the I-Scel cleavage site in 
most cases. 

Southern blots demonstrated that, as expected, the sequence 
15 between the I-Scel cleavage sites were eliminated completely in 
the recombinant. No codA-specif ic DNA whatsoever was detected any 
longer in the genome of the regenerated plants (Fig. 13 B and D, 
lanes 5, 6 and 8,9). 

20 Example 5 : 

Various transgenic tobacco plant lines were generated which, 
between the halves of the ^-glucuronidase gene (arrangement as 
described above) also contained cleavage sites for the 

25 abovementioned restriction enzymes in addition to a I-Scel 
cleavage site by means of cloning synthetic oligonucleotides 
(Fig. 10). Seedlings of this tobacco line were inoculated in each 
case in direct comparison with agrobacteria capable of expressing 
either I-Scel or the corresponding enzyme in plant cells. The 

30 resulting calli were then stained histochemically after 2 weeks. 
The results are shown in Table 4. 

The plasmid pGU.C.US.B was cut with I-Scel so that the codA gene 
was excised from the plasmid. The digested DNA was separated by 
35 means of agarose gel electrophoresis, the larger band was excised 
and purified by means of the QIAquick Gel Extraction Kit (Qiagen, 
Hilden) and subsequently ligated and transformed into E. coli. 
The resulting plasmid was then cut with Xbal. 

40 The complementary single-stranded oligonucleotides OPN25 and 
OPN26 were made double-stranded by briefly heating to 92°C and 
subsequent cooling and then subsequently ligated with the 
Xbal-cut plasmid. The resulting SI construct (pSI) contains the 
cleavage sites for I-Scel, I-Cpal, I-CpaII and I-Crel ((see Fig. 

45 10 (A)). 
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OPN15 (SEQ ID NO: 25): 

5'- CTA GTA CAA AAC GTC GTG AGA CAT TTT AAT CTG AAG GTT TGG CAC 
CTC GAT GTC GGC TCA TC-3 ' 

5 OPN16 (SEQ ID NO: 26): 

5' -CTA GGA TGA GCC GTC ATC GAG GTG CCA AAC CTT CAG ATT AAA ATG 
TCT CAC GAC GTT TTG TA-3 ' 

The complementary single-stranded oligonucleotides OPN27 and 
10 OPN28 were made double-stranded by briefly heating to 92°C and 
subsequent cooling and then subsequently ligated with the 
Xbal-cut plasmid. The resulting SD construct (pSD) contains the 
cleavage sites for I-Scel and I-Chul (see Fig, 10 (B)). 

15 OPN17 (SEQ ID NO: 27): 

5' -CTA GTC CGA AAA CGC CGT GAG ACA TAT TGG TTA CGA TCC TAA GGT 
AGC GAA ATT CAC CCG GTA ACT CTG TGC CAG-3 ' 

OPN18 (SEQ ID NO: 28) : 
20 5' -CTA GCT GGC ACA GAG TTA CCG GGT GAA TTT CGC TAC CTT AGG ATC 
GTA ACC AAT ATG TCT CAC GGC GTT TTC GGA- 3 ' 

Transgenic tobacco plants with the two constructs were generated 
as described further above by means of agrobacterium 
25 transformation. Lines which only contained transgenic sequences 
at one locus were used for the further experiments. These lines 
were determined by the 3:1 segregation into 

phosphinothricin-resistant and nonresistant plants. The selfed 
seedlings were then inoculated with agrobacterial strains which 

30 comprised one of the four constructs for expressing the 

restriction endonucleases or, as the vector control, the plasmid 
BinAR or, as the positive control, a 1:1 mixture of BinAR and 
CISce-I. The inoculations were carried out as described above 
(Puchta H (1999) Methods Mol. Biol. 113:447-451), and for 

35 selection purposes the seedlings were grown over several weeks on 
MS medium supplemented with 100 ng kanamycin per ml, which also 
contained BAP and NAA, vancomycin and cefotaxin (concentrations 
as above). The resulting calli were then subjected to 
histochemical ^-glucuronidase staining as described above. 

40 

All four of the tested restriction enzymes were capable of 
inducing homologous recombination in the same order of magnitude 
as I-Scel (which was employed here in a coinoculation with the 
selection vector pBinAR [AR]) (Table 5). This demonstrates that 
45 homologous recombination can be induced efficiently when using 
any restriction endonucleases. 
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Table 5. Induction of homologous recombination in plants by means 
of various endonuc leases I-Crel, I-Cpal, I-CpaII and I-Chul. 
[ Sectors /calli] refers to the number of areas stained blue in the 
resistant calli. 



5 



10 



15 



Transgenic line 


Enzyme 


Sectors /calli 


Ratio 


SI5 


T CnoT /AT? 

x-bcei / AK 




1.35 




I-Crel 


77/50 


0.54 




I-CpaII 


51/50 


1.02 


SI2 


I-Scel/AR 


8/9 


0.89 




I-Crel 


40/18 


2.22 




I-CpaII 


9/20 


0.45 


SI2 


I-Cpal 


144/106 


1.36 


SD2 


I-Chul 


166/100 


1.66 



20 



25 



30 



35 



40 



45 
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SEQUENCE LISTING 

<110> SunGene GmbH & Co. KGaA 

<120> Systems and Methods for eliminating 

nucleic acid sequences from the chromosomal DNA 
of eukaryotic organisms 

<130> NAE502_2001 

<140> 
<141> 

<160> 30 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 788 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (62).. (766) 

<223> open reading frame coding for I-Scel 
<400> 1 

ggatccagta ctgtacctag aatacaaaga agaggaagaa gaaacctcta cagaagaagt 60 

g atg aaa aac ate aaa aaa aac cag gta atg aac ctg ggt ccg aac tct 109 
Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro Asn Ser 
1 5 10 15 

aaa ctg ctg aaa gaa tac aaa tec cag ctg ate gaa ctg aac ate gaa 157 
Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu lie Glu Leu Asn lie Glu 
20 25 30 

cag ttc gaa gca ggt ate ggt ctg ate ctg ggt gat get tac ate cgt 205. 
Gin Phe Glu Ala Gly lie Gly Leu lie Leu Gly Asp Ala Tyr lie Arg 
35 40 45 

tct cgt gat gaa ggt aaa ace tac tgt atg cag ttc gag tgg aaa aac 253 
Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp Lys Asn 
50 55 60 

aaa gca tac atg gac cac gta tgt ctg ctg tac gat cag tgg gta ctg 301 
Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp Val Leu 
65 70 75 * 80 

tec ccg ccg cac aaa aaa gaa cgt gtt aac cac ctg ggt aac ctg gta 349 
Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 
85 90 95 

ate "ace tgg ggc gee cag act ttc aaa cac caa get ttc aac aaa ctg 397 
lie Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn Lys Leu 
100 105 110 

get age ctg ttc ate gtt aac aac aaa aaa acc ate ccg aac aac ctg 445 
Ala Ser Leu Phe lie Val Asn Asn Lys Lys Thr lie Pro Asn Asn Leu 
115 120 125 

gtt gaa aac tac ctg acc ccg atg tct ctg gca tac tgg ttc atg gat 493 
Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 
130 135 140 

gat ggt ggt aaa tgg gat tac aac aaa aac tct acc aac aaa teg ate 541 
Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser lie 
145 150 155 160 



0817/00023 



CA 02451492 2003-12-18 



gta ctg aac acc cag tct ttc act ttc gaa gaa gta gaa tac ctg gtt 589 
Val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu Val 
165 170 175 

aag ggt ctg cgt aac aaa ttc caa ctg aac tgt tac eta aaa ate aac 637 
Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Leu Lys lie Asn 
180 185 190 

aaa aac aaa ccg ate ate tac ate gat tct atg tct tac ctg ate ttc 685 
Lys Asn Lys Pro lie lie Tyr lie Asp Ser Met Ser Tyr Leu lie Phe 
195 200 205 

tac aac ctg ate aaa ccg tac ctg ate ccg cag atg atg tac aaa ctg 733 
Tyr Asn Leu lie Lys Pro Tyr Leu lie Pro Gin Met Met Tyr Lys Leu 
210 215 220 

ccg aac act ate tec tec gaa act ttc ctg aaa taataagtcg agtactggat 786 

Pro Asn Thr lie Ser Ser Glu Thr Phe Leu Lys 
225 230 235 

cc 788 

<210> 2 
<211> 235 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro Asn Ser 
1 5 10 15 

Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu lie Glu Leu Asn lie Glu 
20 25 30 

Gin Phe Glu Ala Gly lie Gly Leu He Leu Gly Asp Ala Tyr He Arg 
35 40 45 

Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp Lys Asn 
50 55 60 

Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp Val Leu 
65 70 75 80 

Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 
85 90 95 

He Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn Lys Leu 
100 105 110 

Ala Ser Leu Phe lie Val Asn Asn Lys Lys Thr He Pro Asn Asn Leu 
115 120 125 

Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 
130 135 140 

Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser He 
145 150 155 160 

Val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu Val 
165 170 175 

Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Leu Lys He Asn 
180 185 190 

Lys Asn Lys Pro He He Tyr lie Asp Ser Met Ser Tyr Leu He Phe 
195 200 ^ 205 

Tyr Asn Leu He Lys Pro Tyr Leu He Pro Gin Met Met Tyr Lys Leu 
210 215 220 
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pro Asn Thr He Ser Ser Glu Thr Phe Leu Lys 
225 230 235 

<210> 3 
<211> 746 
<212> DNA 

<213> Chlamydomonas applanata 

<220> 

<221> CDS 

<222> (54) ..(737) 

<223> open reading frame of I-Chul with nuclear location 
signal 

<220> 

<221> misc_feature 
<222> (54). .(83) 

<223> coding for nuclear location signal 
<400> 3 

ctcgagtacc tagaatacaa agaagaggaa gaagaaactc tatagaagaa gcc atg 56 

Met 
1 

ggt cca aag aaa aag aga aag gtt ate atg tea tta aca caa caa caa 104 
Gly Pro Lys Lys Lys Arg Lys Val He Met Ser Leu Thr Gin Gin Gin 
5 10 15 

aaa gac tta att ttc' gga tct eta ctg ggt gat gga aat tta caa act 152 
Lys Asp Leu He Phe Gly Ser Leu Leu Gly Asp Gly Asn Leu Gin Thr 

20 25 30 

ggt tea gta ggt agg act tgg cgc tat cga gcg etc cat aaa agt gag 200 
Gly Ser Val Gly Arg Thr Trp Arg Tyr Arg Ala Leu His Lys Ser Glu 
35 " 40 45 

cat cag aca tac tta ttt cat aag tat gaa ate tta aag ccg ctt tgt 248. 
His Gin Thr Tyr Leu Phe His Lys Tyr Glu He Leu Lys Pro Leu Cys 
50 55 60 65 

ggc gaa aat act etc cca aca gaa agt ata gtg ttc gac gaa aga aca 296 
Gly Glu Asn Thr Leu Pro Thr Glu Ser He Val Phe Asp Glu Arg Thr 

70 75 80 

aac aag gag gtt aaa cgt tgg ttt ttc aac aca tta acc aat cct tec 344 
Asn Lys Glu Val Lys Arg Trp Phe Phe Asn Thr Leu Thr Asn Pro Ser 

85 90 95 

tta aaa ttc ttc gca gac atg ttc tac aca tat gac caa aac aca caa 392 
Leu Lys Phe Phe Ala Asp Met Phe Tyr Thr Tyr Asp Gin Asn Thr Gin 

100 105 110 

aaa tgg gtt aaa gat gta cct gta aag gtt caa aca ttc tta act cct 440 
Lys Trp Val Lys Asp Val Pro Val Lys Val Gin Thr Phe Leu Thr Pro 

115 " 120 125 

caa get tta gca tac ttt tat ata gac gat gga gcg tta aaa tgg ctt 488 
Gin Ala Leu Ala Tyr Phe Tyr He Asp Asp Gly Ala Leu Lys Trp Leu 
130 135 140 145 

aat aag tct aac get atg caa att tgt act gaa agt ttc agt caa ggg 536 
Asn Lys Ser Asn Ala Met Gin lie Cys Thr Glu Ser Phe Ser Gin Gly 

150 155 160 

ggc acg att egg ate caa aaa gca eta aaa acg etc tat aat att gat 584 
Gly Thr He Arg He Gin Lys Ala Leu Lys Thr Leu Tyr Asn He Asp 
165 170 175 
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aca acg ttg aca aaa aaa act eta caa gac ggc aga att ggc tat cgt 632 
Thr Thr Leu Thr Lys Lys Thr Leu Gin Asp Gly Arg lie Gly Tyr Arg 
180 185 190 

ata get att cct.gaa gee agt age ggt get ttt cgt gaa gtc att aaa 680 
lie Ala He Pro Glu Ala Ser Ser Gly Ala Phe Arg Glu Val He Lys 
195 200 205 

cct ttt eta gtt gat tgt atg aga tac aaa gtt tct gat ggc aat aaa 728 
Pro Phe Leu Val Asp Cys Met Arg Tyr Lys Val Ser Asp Gly Asn Lys 
210 215 220 225 

ggc cac ctt tagctcgag 746 
Gly His Leu 

<210> 4 
<211> 228 
<212> PRT 

<213> Chlamydomonas applanata 
<400> 4 

Met Gly Pro Lys Lys Lys Arg Lys Val He Met Ser Leu Thr Gin Gin 
1 5 10 15 

Gin Lys Asp Leu He Phe Gly Ser Leu Leu Gly Asp Gly Asn Leu Gin 
20 25 30 

Thr Gly Ser Val Gly Arg Thr Trp Arg Tyr Arg Ala Leu His Lys Ser 
35 40 45 

Glu His Gin Thr Tyr Leu Phe His Lys Tyr Glu lie Leu Lys Pro Leu 
50 55 60 . 

Cys Gly Glu Asn Thr Leu Pro Thr Glu Ser He Val Phe Asp Glu Arg 
65 70 75 80 

Thr Asn Lys Glu Val Lys Arg Trp Phe Phe Asn Thr Leu Thr Asn Pro 
85 90 95 

Ser Leu Lys Phe Phe Ala Asp Met Phe Tyr Thr Tyr Asp Gin Asn Thr 
100 105 110 

Gin Lys Trp Val Lys Asp Val Pro Val Lys Val Gin Thr Phe Leu Thr 
115 120 125 

Pro Gin Ala Leu Ala Tyr Phe Tyr He Asp Asp Gly Ala Leu Lys Trp 
130 135 140 

Leu Asn Lys Ser Asn Ala Met Gin He Cys Thr Glu Ser Phe Ser Gin 
145 . 150 155 160 

Gly Gly Thr He Arg He Gin Lys Ala Leu Lys Thr Leu Tyr Asn He 
165 170 175 

Asp Thr Thr Leu Thr Lys Lys Thr Leu Gin Asp Gly Arg He Gly Tyr 
180 185 190 . 

Arg He Ala He Pro Glu Ala Ser Ser Gly Ala Phe Arg Glu Val lie 
195 200 205 

Lys Pro Phe Leu Val Asp Cys Met Arg Tyr Lys Val Ser Asp Gly Asn 
210 215 220 

Lys Gly His Leu 
225 



<210> 5 
<211> 582 
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<212> DNA 

<213> Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (55). .(573) 

<223> openreading frame coding for I-Crel with nuclear 
location signal 

<220> 

<221> misc_feature 
<222> (55). .(84) 

<223> coding for nuclear location signal 
<400> 5 

ctcgagtacc tagaatacaa agaagaggaa gagaaacctc taccagaaga agcc atg 57 

Met 
1 

ggt cca aag aaa aag aga aag gtt ate atg aat aca aaa tat aat aaa 105 
Gly Pro Lys Lys Lys Arg Lys Val He Met Asn Thr Lys Tyr Asn Lys 
5 10 15 

gag ttc tta etc tac tta gca ggg ttt gta gac ggt gac ggt age ata 153 
Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser lie 
20 25 30 

ate get caa att aag cct aat cag tct tat aaa ttt aag cat cag eta 201 
He Ala Gin He Lys Pro Asn Gin Ser Tyr Lys Phe Lys His Gin Leu 

35 40 45 

tea etc gcg ttc caa gtc acg caa aag aca cag aga cgt tgg ttt tta 249 
Ser Leu Ala Phe Gin Val Thr Gin Lys Thr Gin Arg Arg Trp Phe Leu 
50 55 60 65 

gac aaa tta gtg gat gaa att ggg gtt ggt tat gta aga gat agg ggt 297 
Asp Lys Leu Val Asp Glu He Gly Val Gly Tyr Val Arg Asp Arg Gly 

70 75 80 

age gtt teg gat tat att eta age gaa ate aag cct ttg cat aat ttt 345 
Ser Val Ser Asp Tyr He Leu Ser Glu He Lys Pro Leu His Asn Phe 
85 * 90 95 

tta aca caa eta caa cct ttt eta aaa eta aaa caa aaa caa gca aat 393 
Leu Thr Gin Leu Gin Pro Phe Leu Lys Leu Lys Gin Lys Gin Ala Asn 
100 105 110 

tta gtt tta aaa att att gaa caa ctt ccg tea gca aaa gaa tec ccg 441 

Leu Val Leu Lys He He Glu Gin Leu Pro Ser Ala Lys Glu Ser Pro 

115 120 125 

gac aaa ttc tta gaa gtt tgt aca tgg gtg gat caa att gca get ctg 489 

Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gin He Ala Ala Leu 
130 135 140 145 

aat gat teg aag acg cgt aaa aca act tct gaa acc gtt cgt get gtg 537 

Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala Val 
150 155 160 

eta gac agt tta agt gaa aaa aag aaa teg tec ccg tagctcgag 582 
Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 
165 170 

<210> 6 
<211> 173 
<212> PRT 

<213> Chlamydomonas reinhardtii 
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<400> 6 

Met Gly Pro Lys Lys Lys Arg Lys Val lie Met Asn Thr Lys Tyr Asn 
1 " 5 . 10 15 

Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser 
20 25 30 

lie lie Ala Gin He Lys Pro Asn Gin Ser Tyr Lys Phe Lys His Gin 
35 40 45 

Leu Ser Leu Ala Phe Gin Val Thr Gin Lys Thr Gin Arg Arg Trp Phe 
50 55 60 

Leu Asp Lys Leu Val Asp Glu He Gly Val Gly Tyr Val Arg Asp Arg 
65 70 75 80 

Gly Ser Val Ser Asp Tyr He Leu Ser Glu lie Lys Pro Leu His Asn 
85 90 95 

Phe Leu Thr Gin Leu Gin Pro Phe Leu Lys Leu Lys Gin Lys Gin Ala 
100 105 110 

Asn Leu Val Leu Lys lie He Glu Gin Leu Pro Ser Ala Lys Glu Ser 
115 " 120 125 

Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gin He Ala Ala 
130 135 140 

Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala 
145 150 155 160 

Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 
165 170 



<210> 7 
<211> 546 
<212> DNA 

<213> Chlamydomonas segnis 

<220> 

<221> CDS 

<222> (52).. (537) 

<223> open readings frame coding for I-Cpal with nuclear 
location signal 

<220> 

<221> misc_feature 
<222> (52).. (81) 

<223> coding for nuclear location signal 
<400> 7 

ctcgagtacc tagaatacaa gaagagaaga agaacctcta cagaagaagc c atg ggt 57 

Met Gly 
1 

cca aag aaa aag aga aag gtt ate atg gac att aat cct caa tgg att 105 
Pro Lys Lys Lys Arg Lys Val He Met Asp He Asn Pro Gin Trp He 
5 ~ 10 15 

aca ggt ttc gta gat ggg gaa ggt tgt ttt agt gta agt att ctt aga 153 
Thr Gly Phe Val Asp Gly Glu Gly Cys Phe Ser Val Ser He Leu Arg 
20 25 30 

aat aat teg ttg cgc tat ggc cat cag ctt caa cca gaa ttc gta gtg 201 
Asn Asn Ser Leu Arg Tyr Gly His Gin Leu Gin Pro Glu Phe Val Val 
35 40 45 50 
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acc caa cat aaa tta gat gca aat gtt tta tat gca tta. aaa gac tac 249 
Thr Gin His Lys Leu Asp Ala Asn Val Leu Tyr Ala Leu Lys Asp Tyr 

55 60 65 

ttt aaa gtt gga tea gtc gtt gtg aat cat ggg gaa egg ctt tgc tat 297 
Phe Lys Val Gly Ser Val Val Val Asn His Gly Glu Arg Leu Cys Tyr 
70 75 80 

aaa gtc aaa aat att gat cac ttt ata acc gtc att ata cca ttt ttc 345 
Lys Val Lys Asn lie Asp His Phe lie Thr Val lie lie Pro Phe Phe 
85 90 95 

gaa aaa cat gag eta aaa aca aaa aga aga att gaa ttt ctt cga ttt 393 
Glu Lys His Glu Leu Lys Thr Lys Arg Arg lie Glu Phe Leu Arg Phe 
100 105 110 

cga aaa ate tgc ttg ctg tta aaa gca ggt aga cat tta gaa teg cag 441 
Arg Lys lie Cys Leu Leu Leu Lys Ala Gly Arg His Leu Glu Ser Gin 
115 120 125 130 

gaa gga ttc gag aaa gtg ttg gat tta gca aaa aaa etc cgt ate aat 489 
Glu Gly Phe Glu Lys Val Leu Asp Leu Ala Lys Lys Leu Arg lie Asn 
135 140 145 

gag aaa aac tac cag gaa tct ate aaa cgt ttt gaa gaa act ggc gag 537 
Glu Lys Asn Tyr Gin Glu Ser lie Lys Arg Phe Glu Glu Thr Gly Glu 
150 155 160 

taactcgag 546 

<210> 8 
<211> 162 
<212> PRT 

<213> Chlamydomonas segnis 
<400> 8 

Met Gly Pro Lys Lys Lys Arg Lys Val lie Met Asp lie Asn Pro Gin 
1 5 10 15 

Trp lie Thr Gly Phe Val Asp Gly Glu Gly Cys Phe Ser Val Ser lie 
20 25 30 

Leu Arg Asn Asn Ser Leu Arg Tyr Gly His Gin Leu Gin Pro Glu Phe 
35 40 45 

Val Val Thr Gin His Lys Leu Asp Ala Asn Val Leu Tyr Ala Leu Lys 
50 55 . 60 

Asp Tyr Phe Lys Val Gly Ser Val Val Val Asn His Gly Glu Arg Leu 
65 70 75 80 

Cys Tyr Lys Val Lys Asn lie Asp His Phe lie Thr Val lie lie Pro 

85 90 95 

Phe Phe Glu Lys His Glu Leu Lys Thr Lys Arg Arg lie Glu Phe Leu 
100 105 110 

Arg Phe Arg Lys lie Cys Leu Leu Leu Lys Ala Gly Arg His Leu Glu 
115 120 125 

Ser Gin Glu Gly Phe Glu Lys Val Leu Asp Leu Ala Lys Lys Leu Arg 
130 135 140 

lie Asn Glu Lys Asn Tyr Gin Glu Ser lie Lys Arg Phe Glu Glu Thr 
145 150 155 160 

Gly Glu 
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<210> 9 
<211> 793 
<212> DNA 

<213> Chlamydoraonas segnis 

<220> 

<221> CDS 

<222> (53). .(784) 

<223> open reading frame coding for I-CpaII with nuclear 
location signal 

<220> 

<221> misc_feature 
<222> (53).. (82) 

<223> coding for nuclear location signal 
<400> 9 

ctcgagtacc tagaaacaaa gaagaggaag aagaaactct acagaagaag cc atg ggt 58 

Met Gly 
1 

cca aag aaa aag aga aag gtt ate atg acc gat tct aaa tct aga aac 106 
Pro Lys Lys Lys Arg Lys Val lie Met Thr Asp Ser Lys Ser Arg Asn 
5 10 15 

aac aat aat ttt tta age aat aat ctt tta cct ttg acc gat gac gag 154 
Asn Asn Asn Phe Leu- Ser Asn Asn Leu Leu Pro Leu Thr Asp Asp Glu 
20 25 30 

aag get tta att gcg ggg aca ctt tta ggg gat get cat att caa aag 202 
Lys Ala Leu lie Ala Gly Thr Leu Leu Gly Asp Ala His lie Gin Lys 
35 40 45 50 

cgt ggt gat age tat agg eta aaa ata get cat ggc ttg gat cat gaa 250 
Arg Gly Asp Ser Tyr Arg Leu Lys lie Ala His Gly Leu Asp His Glu 
55 60 65 

gag ctt gtc gtc tgg aag tat aac cgt tta ate agg ttg tgt caa aca 298' 
Glu Leu Val Val Trp Lys Tyr Asn Arg Leu lie Arg Leu Cys Gin Thr 
70 75 80 

aca caa ccc cca agg gtg gaa acc tac tea aca aag tta aag tct ggc 346 
Thr Gin Pro Pro Arg Val Glu Thr Tyr Ser Thr Lys Leu Lys Ser Gly 
85 90 95 

gta ttg cct caa ggg gtt gtt ttc tat acc teg tec gga aag tat tta 394 
Val Leu Pro Gin Gly Val Val Phe Tyr Thr Ser Ser Gly Lys Tyr Leu 
100 * 105 110 

aaa gag act tat gac ctt ttt tat aaa caa act gca gac ggt egg agg 442 
Lys Glu Thr Tyr Asp Leu Phe Tyr Lys Gin Thr Ala Asp Gly Arg Arg 
115 120 125 130 

gta aaa aca ata aca cag gag ttg ate gac agt tta ccc aag cat cca 490 
Val Lys Thr lie Thr Gin Glu Leu lie Asp Ser Leu Pro Lys His Pro 
135 140 145 

ttg gtc tta gca gec ttt ttt atg gac gat ggt agt gtt egg tec gac 538 
Leu Val Leu Ala Ala Phe Phe Met Asp Asp Gly Ser Val Arg Ser Asp 
150 155 160 

tgt tat tea gga aag att gca acg cca ggg ttt get ggt aaa gaa gaa 586 
Cys Tyr Ser Gly Lys lie Ala Thr Pro Gly Phe Ala Gly Lys Glu Glu 
165 170 175 
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age cag ttg ttg tgt aac tat eta cac agt tgg gat gtt caa gca aac 634 
Ser Gin Leu Leu Cys Asn Tyr Leu His Ser Trp Asp Val Gin Ala Asn 
180 185 " 190 

gta gtt get cat aaa aaa gca aac aat cag tat tac att ggg etc cca 682 
Val Val Ala His Lys Lys Ala Asn Asn Gin Tyr Tyr lie Gly Leu Pro 
195 200 205 210 

gca aaa aca ttt ggt cgc ttt att aac att att gaa cec tac gtt aga 730 
Ala Lys Thr Phe Gly Arg Phe lie Asn lie lie Glu Pro Tyr Val Arg 
215 220 225 

gaa gtt cct get tta tgt tat aaa tta aac gaa tea aga aaa ccc cgt 778 
Glu Val Pro Ala Leu Cys Tyr Lys Leu Asn Glu Ser Arg Lys Pro Arg 
230 235 240 

aac gac tgactcgag 793 
Asn Asp 

<210> 10 
<211> 244 
<212> PRT 

<213> Chlamydomonas segnis 
<400> 10 

Met Gly Pro Lys Lys Lys Arg Lys Val lie Met Thr Asp Ser Lys Ser 
1 5 10 15 

Arg Asn Asn Asn Asn Phe Leu Ser Asn Asn Leu Leu Pro Leu Thr Asp 
20 25 30 

Asp Glu Lys Ala Leu lie Ala Gly Thr Leu Leu Gly Asp Ala His lie 
35 40 45 

Gin Lys Arg Gly Asp Ser Tyr Arg Leu Lys lie Ala His Gly Leu Asp 
50 55 60 

His Glu Glu Leu Val Val Trp Lys Tyr Asn Arg Leu lie Arg Leu Cys 
65 70 75 80 

Gin Thr Thr Gin Pro Pro Arg Val Glu Thr Tyr Ser Thr Lys Leu Lys 

85 90 95 

Ser Gly Val Leu Pro Gin Gly Val Val Phe Tyr Thr Ser Ser Gly Lys 
100 105 110 

Tyr Leu Lys Glu Thr Tyr Asp Leu Phe Tyr Lys Gin Thr Ala Asp Gly 
115 120 125 

Arg Arg Val Lys Thr lie Thr Gin Glu Leu lie Asp Ser Leu Pro Lys 
130 135 140 

His Pro Leu Val Leu Ala Ala Phe Phe Met Asp Asp Gly Ser Val Arg 
145 150 155 160 

Ser Asp Cys Tyr Ser Gly Lys lie Ala Thr Pro Gly Phe Ala Gly Lys 
165 170 175 

Glu Glu Ser Gin Leu Leu Cys Asn Tyr Leu His Ser Trp Asp Val Gin 
180 185 190 

Ala Asn Val Val Ala His Lys Lys Ala Asn Asn Gin Tyr Tyr lie Gly 
195 200 205 

Leu Pro Ala Lys Thr Phe Gly Arg Phe lie Asn lie lie Glu Pro Tyr 
210 215 220 

Val Arg Glu val Pro Ala Leu Cys Tyr Lys Leu Asn Glu Ser Arg Lys 
225 230 235 240 
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<210> 11 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 11 

cggctcgagc tacggggacg atttcttttt ttcac 35 

<210> 12 
<211> 120 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 12 

cggctcgagt acctagaata caaagaagag gaagaagaaa cctctacaga agaagccatg 60 
ggtccaaaga aaaagagaaa ggttatcatg aatacaaaat ataataaaga gttcttactc 120 

<210> 13 
<211> 116 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 13 

cggctcgagt acctagaata caaagaagag gaagaagaaa cctctacaga agaagccatg 60 
ggtccaaaga aaaagagaaa ggttatcatg gacattaatc ctcaatggat tacagg 116 

<210> 14 
<211> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 14 

cggctcgagt tactcgccag tttcttcaaa acg 33 

<210> 15 
<211> 113 
<212>. DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 15 

cggctcgagt acctagaata caaagaagag gaagaagaaa cctctacaga agaagccatg 60 
ggtccaaaga aaaagagaaa ggttatcatg accgattcta aatctagaaa caa 113 
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<210> 16 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 16 

cggctcgagc taaaggtggc ctttattgcc atcag 35 

<210> 17 
<211> 114 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 17 

cggctcgagt acctagaata caaagaagag gaagaagaaa cctctacaga agaagccatg 60 
ggtccaaaga aaaagagaaa ggttatcatg tcattaacac aacaacaaaa agac 114 

<210> 18 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 18 

cggctcgagc taaaggtggc ctttattgcc atcag 35 

<210> 19 
<211> 66 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 19 

cggctctaga gcggccgcct agggataaca gggtaataga atcccacaaa aatctgagct 60 
taacag 66 

<210> 20 
<211> 65 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 20 

cggctctaga ctattaccct gttatcccta ggcccgatct agtaacatag atgacaccgc 60 
gcgcg 65 

<210> 21 

<211> 34 

<212> DNA 

<213> Artificial sequence 
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<220> ' 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 21 

cggaagcttc gtcaccaatc ccaattcgat ctac 34 

<210> 22 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide, primer 

<400> 22 

cggaagcttc cacttgcaaa gtcccgctag tgcc 34 

<210> 23 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 23 

cggaagcttc gtcaccaatc ccaattcgat ctac 34 

<210> 24 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 24 

cggaagcttc cacttgcaaa gtcccgctag tgcc 34 

<210> 25 
<211> 62 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

. <400> 25 

ctagtacaaa acgtcgtgag acattttaat ctgaaggttt ggcacctcga tgtcggctca 60 

tc 62 

<210> 2 6 

<211> 62 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 26 

ctaggatgag ccgtcatcga ggtgccaaac cttcagatta aaatgtctca cgacgttttg 60 
ta 62 
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<210> 27 
<211> 75 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 27 

ctagtccgaa aacgccgtga gacatattgg ttacgatcct aaggtagcga aattcacccg 60 
gtaactctgt gccag 75 

<210> 28 
<211> 75 
<212> DNA 

<213> Artificial sequence 
<22p> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 28 

ctagctggca cagagttacc gggtgaattt cgctacctta ggatcgtaac caatatgtct 60 
cacggcgttt tcgga 75 

<210> 29 
<211> 7 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: nuclear 
location sequence 

<400> 29 

Pro Lys Thr Lys Arg Lys Val 
1 5 

<210> 30 
<211> 7 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: nuclear 
location sequence 

<400> 30 

Pro Lys Lys Lys Arg Lys Val 
1 5 
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We claim: 

1. A recombination system which comprises , in a eukaryotic cell 
5 or organism, 

I) a transgenic recombination construct inserted into the 
chromosomal DNA of a eukaryotic organism comprising a 
sequence consisting, in the 5 '/3 '-direction, of 



10 



20 



al) a first homology sequence A and 



bl) at least one recognition sequence for the 

site-directed induction of DNA double-strand breaks 
15 and 

a2) a second homology sequence B f the homology sequences 
A and B having a sufficient length and sufficient 
homology in order to ensure homologous recombination, 



together with 



II) ah enzyme suitable for inducing DNA double-strand breaks 
at the recognition sequence (b) for the site-directed 
25 induction of DNA double-strand breaks or a nucleic acid 

sequence encoding an enzyme suitable for inducing DNA 
double-strand breaks at the recognition sequence (b) . 

2. A recombination system as claimed in claim 1, wherein the 
30 recombination construct is constructed as follows: 

al) a first homology sequence A and 

bl) a recognition sequence for the site-directed induction of 
35 DNA double-strand breaks and 

c) a further nucleic acid sequence and 

a2) a second homology sequence B, the homology sequences A 
40 and B having a sufficient length and sufficient homology 

in order to ensure homologous recombination. 



45 



Fig. + Sequ 
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3. A recombination system as claimed in claim 1 or 2, wherein 
. the recombination construct is constructed as follows: 

al) a first homology sequence A and 

bl) a first recognition sequence for the site-directed 
induction of DNA doubLe-strand breaks and 



10 



25 



35 



c) a further nucleic acid sequence and 

b2) a second recognition sequence for the site-directed 
induction of DNA double-strand breaks and 



al) [sic] a second homology sequence B, the homology 
15 sequences A and B having a sufficient length and 

sufficient homology in order to ensure homologous 
recombination. 

4. A recombination system as claimed in any of claims 1 to 3, 
20 wherein the recombination construct or the further nucleic 
acid sequence encompasses at least one of the elements 
selected from the group consisting of 



i) positive selection markers 

ii) negative selection markers 

iii) reporter genes 

30 iv) replication origins 

v) multiple cloning regions 

vi) border sequences for Agrobacterium trans fee tion 



vii) sequences which enable homologous recombination or 
insertion into the genome of a host organism 



viii) expression cassette for an enzyme suitable for inducing 
40 DNA double-strand breaks at the recognition sequence 

for the site-directed induction of DNA double-strand 
breaks 

5. A recombination system as claimed in any of claims 1 to 4, 
45 which comprises that an enzyme suitable for inducing DNA 
double-strand breaks at the recognition sequence for the 
site-directed induction of DNA double-strand breaks is 
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selected from the group consisting of restriction 
endonucleases, homing endonucleases, group II intron 
endonucleases ^ recombinases , transposases and chimeric 
nucleases . 

5 

6. A recombination system as claimed in any of claims 1 to 5, 
wherein the enzyme suitable for inducing DNA double-strand 
breaks at the recognition sequence for the site-directed 
induction of DNA double-strand breaks is selected from the 

10 group of the homing endonucleases consisting of F-Scel, 

F-SceII f F-Suvl, F-TevI, F-Tevll, I-Amal, I-Anil, I-Ceul, 
I-CeuAIIP, I-Chul, I-Cmoel, I-Cpal, I-CpaII, I-rCrel, 
1-CrepsbIP, I-CrepsbIIP, I-CrepsbIIIP, I-CrepsbIVP, I-Csml r 
I-Cvul, I-CvuAIP, I-Ddil, I-DdiII, I-Dirl, I-Dmol, I-Hmul, 

15 I-Hmull, I-HspNIP, I-Llal, I-Msol, I-Naal, I-NanI, I-NclIP, 
I-NgrIP, I-NitI, I-Njal, I-Nsp236lP, I-Pakl r I-PboIP, 
I-PcuIP, I-PcuAI, I-PcuVI, I -Pgr IP, I-PobIP, I-PorI r 
I-PorIIP, I-Ppblp, I-Ppol, I-SPBetalP, I-Scal, I-Scel, 
I-Scell, I-SceIII . , I-SceIV, 1-SceV, i-SceVI, I-SceVlI, 

20 I-SexIP, I-SneIP r I-SpomCP, I-SpomIP, I-SpomIIP, I-SquIP, 

I-Ssp6803I> I-SthPhiJP, I-SthPhiST3P, I-SthPhiS3bP, I-TdeIP, 
I-TevI, I-Tevll, I-TevIII, I-UarAP, I-UarHGPAlP, 
I-UarHGPAl3P, I-VinIP, I-ZbiIP, PI-MtuI, PI-MtuHIP f 
PI-MtuHIIP, Pl-Pful, Pl-PfuII, Pl-Pkol, Pl-Pkoll, PI-PspI, 

25 PI-Rma43812lP, PI-SPBetalP, Pl-Scel, PI-Tful, PI-TfuII, 

PI-Thyl, PI-Tlil and PI-Tlill. 

7. A recombination system as claimed in any of claims 1 to 6, 
wherein the enzyme suitable for inducing DNA double-strand 

30 breaks at the recognition sequence for the site-directed 

induction of DNA double-strand breaks is selected from the 
group of the homing endonucleases consisting of the enzymes 
as shown in SEQ ID NO: 2 r 4,6, 8 and 10. 

35 8. A recombination system as claimed in any of claims 1 to 7, 
wherein the enzyme suitable for inducing DNA double-strand 
breaks at the recognition sequence for the site-directed 
induction of DNA double-strand breaks is realized using an 
expression cassette encompassing a nucleic acid sequence 

40 encoding said enzyme . 

9- A recombination system as claimed in any of claims 1 to 8, 
wherein the enzyme suitable for inducing DNA double-strand 
breaks at the recognition sequence for the site-directed 
45 induction of DNA double-strand breaks is realized using an 
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expression cassette encompassing a nucleic acid sequence 
encoding said enzyme as shown in SEQ ID NO: 1, 3, 5, 7 or 9. 

10. A method for removing a DNA sequence from the chromosomal DNA 
5 of a eukaryotic cell or organism, which comprises combining, 
in a eukaryotic cell or organism, 

I) a transgenic recombination construct inserted into the 
chromosomal DNA of a eukaryotic organism comprising a 

10 sequence consisting, in the 5 f /3' -direction, of 

al) a first homology sequence A and 

bl ) at least one recognition sequence for the 
15 site-directed induction of DNA double-strand breaks 

and 

a2) a second homology sequence B, the homology sequences 
A and B having a sufficient length and sufficient 
20 homology in order to ensure homologous recombination, 

together with 

II) an enzyme suitable for inducing DNA double-strand breaks 
25 at the recognition sequence (b) for the site-directed 

induction of DNA double-strand breaks, 

and the induction of DNA double-strand breaks at the 
recognition sequence for the site-directed induction of DNA 
30 double-strand breaks and the homologous recombination taking 
place between the homology sequences A and B. 



35 



40 



11. A method as claimed in claim 10, wherein the recombination 
construct is constructed as follows: 

al) a first homology sequence A and 

bl) a first recognition sequence for the site-directed 
induction of DNA double-strand breaks and 

c) a further nucleic acid sequence and 



a2) a second homology sequence B, the homology sequences A 
and B having a sufficient length and sufficient homology 
45 in order to ensure homologous recombination. 
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12. A method as claimed in claim 10 or 11, wherein the 
recombination construct is constructed as follows: 

al) a first homology sequence A and 

bl) a first recognition sequence for the site-directed 
induction of DNA double-strand breaks and 



10 



c) a further nucleic acid sequence and 

b2) a second recognition sequence for the site-directed 
induction of DNA double-strand breaks and 



a2) a second homology sequence B, the homology sequences A 
15 and B having a sufficient length and sufficient homology 

in order to ensure homologous recombination. 

13. A method as claimed in any of claims 10 to 12, wherein the 
recombination construct or the further nucleic acid sequence 

20 encompasses at least one of the elements selected from the 
group consisting of 

i) positive selection markers 

25 ii) negative selection markers 

iii) reporter genes 

iv) replication origins 

30 

v) multiple cloning regions 

vi) border sequences for Agrobacterium trans fection 

35 vii) sequences which enable homologous recombination or 

insertion into the genome of a host organism 

viii) expression cassette for an enzyme suitable for inducing 
DNA double-strand breaks at the recognition sequence 
40 for the site-directed induction of DNA double-strand 

breaks 

14. A method as claimed in any of claims 10 to 13, which 
comprises that the enzyme suitable for inducing DNA 

45 double-strand breaks at the recognition sequence for the 
site-directed induction of DNA double-strand breaks is 
selected from the group consisting of restriction 
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endonucleases, homing endonucleases, recombinases , 
transposases and chimeric nucleases. 

A method as claimed in any of claims 10 to 14 , wherein the 
enzyme suitable for inducing DNA double-strand breaks at the 
recognition sequence for the site-directed induction of DNA 
double-strand breaks is selected from the group of the homing 
endonucleases consisting of F-Scel, F-SceII f F-Suvl, F-TevI, 
F-TevII, I-Amal, I-Anil, I-Ceul, I-CeuAIIP, I-Chul, I-Cmoel, 
I-Cpal, I-CpaII, I-Crel f I-CrepsbIP, I-CrepsbIIP, 
I-CrepsbIIIP, I-CrepsbIVP f I-CsmI, I-Cvul, I-CvuAIP, I-Ddil, 
I-DdiII f I-Dirl, I-DmoI f I-Hmul, I-HmuII, I-HspNIP r I-Llal, 
I-Msol, I-Naal, I-NanI, I-NclIP, I-NgrIP, X-HitX, I-Njal, 
I-Nsp236lP, I-PakI, I-PboIP, I-PcuIP, I-PcuAI, I-PcuVI, 
I-PgrIP, I-PobIP, I-Porl, I-PorIIP, I-PpbIP, I-Ppol, 
I-SPBetaIP, I-Scal f I-Scel, I-SceII, I-SceIII , I-SceIV, 
I-SceV, I-SceVI, I-SceVII, I-SexIP, I-SneIP r I-SpomCP, 
I-SpomIP, I-SponilIP, I-SquIP, I-Ssp6803I f I-SthPhiJP, 
I-SthPhiST3P, I-SthPhiS3bP, I-TdeIP, I-TevI, I-TevII, 
I-TevIII, I-UarAP, I-UarHGPAlP, I-UarHGPAl3P, I-VinIP, 
I-ZbiIP, PI-MtuI, PI-MtuHIP, PI-MtuHIIP, Pl-Pful, Pl-PfuII, 
Pl-Pkol, Pl-PkoII, PI-PspI, PI-Rma43812IP, PI-SPBetalP r 
Pl-Scel, PI-Tful, PI-TfuII, PI-Thyl, PI-Tlil and PI-Tlill. 

25 16. A method as claimed in any of claims 10 to 15, wherein the 

enzyme suitable for inducing DNA double-strand breaks at the 
recognition sequence for the site-directed induction of DNA 
double-strand breaks is selected from the group of the homing 
endonucleases consisting of the enzymes as shown in SEQ ID 

30 NO: 2, 4, 6, 8 and 10. 

17. A method as claimed in any of claims 10 to 16, wherein the 
enzyme suitable for inducing DNA double-strand breaks at the 
recognition sequence for the site-directed induction of DNA 

35 double-strand breaks is realized using an expression cassette 
encompassing a nucleic acid sequence encoding said enzyme. 

18. A method as claimed in any of claims 10 to 17, wherein the 
enzyme suitable for inducing DNA double-strand breaks at the 

40 recognition sequence for the site-directed induction of DNA 

double-strand breaks is realized using an expression cassette 
encompassing a nucleic acid sequence encoding said enzyme as 
shown in SEQ ID NO: 1, 3, 5, 7 or 9. 

45 19. An organism comprising a recombination system as claimed in 
any of claims 1 to 9. 
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20. An organism as claimed in claim 19 selected from the group 
consisting of yeasts , algae , fungi and animal or plant 
organisms. 

5 21. An organism as claimed in claim 19 or 20 selected from the 
group of the plant organisms. 

An organism as claimed in any of claims 19 or 22, wherein the 
plant organism is selected from the group consisting of 
Arabidopsis thaliana, tobacco, wheat, rye f barley, oats, 
oilseed rape, maize, potato, sugar beet, soybean, sunflower, 
pumpkin/squash or peanut. 

23. A cell culture, organ, tissue, part or transgenic propagation 
15 material derived from an organism as claimed in any of claims 
19 to 22. 

The use of an organism as claimed in any of claims 19 to 22 
or of a cell culture, organ, tissue, part or triansgenic 
propagation material as claimed in claim 23 derived therefrom 
as foodstuff, feedstuff or seed or for the production of 
pharmaceuticals or fine chemicals. 



25 



30 



35 



40 



24. 

20 



45 



CA 02451492 2003-12-18 



(12) NACH DEM VERTRAG UBER DIE INTERNATIONALE ZUSAMMEN ARBEIT AUF DEM GEBEET DES 
PATENTWESENS (PCT) VERdFFENTLICHTE INTERNATIONALE ANMELDUNG 



(19) Weltorganisation fur geistiges Eigentum 
Internationales Buro 

(43) Internationales Veroffentlichungsdatum 
16. Januar 2003 (16.01.2003) 




(10) Internationale Ver6ffentlichungsnummer 

PCT WO 03/004659 A3 



(51) Internationale Pa ten tklassi filiation 7 : C12N 15/90, 

15/82, 5/00, A01K 67/027 



(21) Internationales Aktenzeichen: 

(22) Internationales Anmeldedatum: 



PCT/EP02/07281 



(25) Einreichungssprache: 

(26) Verflffentlichungssprache: 



2. Juli 2002 (02.07.2002) 
Deutsch 
Deutsch 



(30) Angaben zur Prior itat: 

101 31786.7 4. Juli 2001 (04.07.2001) DE 

(71) Anmelder (fur alle Bestimmungsstaaten mit Ausnahme 
von US): SUNGENE GMBH & CO. KGAA [DE/DE]; 
Corrensstrasse 3, 06466 Gatersleben (DE). INSTITUT 
F. PFLANZENGENETIK [DE/DE]; u. Kulturpflanzen- 
forschung, Corrensstr. 3, 06466 Gatersleben (DE). 

(72) Erfinder; und 

(75) Erfinder/Anmelder (nur fur US): PUCHTA, Holger 

[DE/DE]; Uber der Eine 16a, 06449 Aschersleben (DE). 
BIESGEN, Christian [DE/DE]; Reichenstr. 29, 06484 
Quedlinburg (DE). 

(74) Anwalt: BIEBERBACH, Andreas; BASF Aktienge- 
sellschaft, 67056 Ludwigshafen (DE). 



(81) Bestimmungsstaaten (national): AE, AG, AL, AM, AT, 
AU, AZ, B A, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, 
CU, CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, 
GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, 
KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, 
MN, MW, MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZM, ZW. 

(84) Bestimmungsstaaten (regional): ARIPO-Patent (GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
eurasisches Patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), europaisches Patent (AT, BE, BG, CH, CY, CZ, DE, 
DK, EE, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, 
SE, SK, TR), OAPI-Patent (BF, BJ, CF, CG, CI, CM, GA, 
GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

VerSffentlicht: 

— mit internationalem Recherche nbericht 



(88) Veroffentlichungsdatum des internationalen 

Recherchenberichts: 25. September 2003 

Zur Erkldrung der Zweibuchstaben-Codes und der anderen 
Abkurzungen wird auf die Erkldrungen ("Guidance Notes on 
Codes and Abbreviations") am Anfangjeder reguldren Ausgabe 
der PCT-Gazette verwiesen. 



^ (54) Title: RECOMBINATION SYSTEMS AND A METHOD FOR REMOVING NUCLEIC ACID SEQUENCES FROM THE 
^ GENOME OF EUKARYOTIC ORGANISMS 

ON (54) Bezeichnung: REKOMBINATIONSSYSTEME UND VERFAHREN ZUM ENTFERNEN VON NUKLEINSAURESE- 
QUENZEN AUS DEM GENOM EUKARYOTISCHER ORGANISMEN 

O (57) Abstract: The invention relates to recombination systems and to a method for removing nucleic acid sequences from the chro- 
mosomal DNA of eukaryotic organisms. The invention also relates to transgenic organisms (preferably plants), containing said 
f^J systems or produced by said method. 

^ (57) Zusammenfassung: Die Erfindung betrifft Rekombinationssysteme und Verfahren zum Entfernen von Nukleihsauresequenzen 
aus der chromosomalen DNA eukaryorischer Organismen, sowie transgene Organismen - bevorzugt Pflanzen - die diese Systeme 
^ enthalten bzw. mit diesen Verfahren hergestellt wurden. 



